An  Implementation  of  Process  Swapping  in  MINK 
(A  Message  Passing  Oriented  Operating  System)  , 


by 
Stanley  George  Kobylanski 

B.S.,  Pennsylvania  State  University,  1972 


A  MASTER'S  REPORT 

submitted  in  partial  fulfillment  of  the 

requirements  for  the  degree 

MASTER  OF  SCIENCE 

Department  of  Computing  and  Information  Sciences 


KANSAS  STATE  UNIVERSITY 
Manhattan,  Kansas 


1989 


Approved  by: 


c/tftxbr^/l-  <^Wl4u>ti) 


Major  Professor 


k*,3 

CZ  A112D&   317bftu 

CONTENTS 

1.  INTRODUCTION       1 

1.1  Purpose  of  Operating  Systems       1 

1.2  MTNIX  Operating  System 2 

1.2.1   Introduction 2 

12.2  MTNIX  and  UNIX 3 

1.2.2.1  General  Description 3 

1.2.2.2  UNIX  Description       4 

1.2.2.3  MTNIX  Description 6 

1.2.3  MTNIX  Limitations 9 

1.2.3.1   MTNIX  Problem 10 

2.  REQUIREMENTS 12 

2.1  Extend  Memory 12 

2.2  Maintain  Existing  Functions 12 

2.3  Maintain  MTNIX  Structure       12 

2.4  User  Administration       13 

2.5  Performance 13 

3.  DETAILED  DESIGN        14 

3.1  Introduction 14 

3.2  Process  Swapping 14 

3.2.1  Purpose  of  Swapping 14 

3.2.2  Swapping  Functions       15 

3.2.3  Alternatives  to  Swapping 15 

3.2.3.1  Increased  Memory       15 

3.2.3.2  Demand  Paging 15 

3.3  General  Design 16 

3.3.1  Introduction 16 

3.3.2  Overview 18 

3.3.3  Swapping  Functions       20 

3.3.3.1  Swap  Device 20 

3.3.3.2  Swapping  Out  Processes 20 

3.3.3.3  Swapping  In  Processes 26 

3.4  Detailed  Module  Description 27 

3.4.1  Introduction 28 

3.4.2  Kernel  -  Management  and  Decisions        28 

3.4.2.1   Swap  Task 28 


i  - 


3.4.2.1.1  Swap  Task  Stales 30 

3.4.2.1.2  Swap  Task  Transitions 30 

3.4.2.1.3  Swap  Out  Algorithm 33 

3.4.2.1.4  Swap  In  Algorithm 35 

3.4.2.2  Clock  Task 35 

3.4.2.3  System  Task 36 

3.4.3  Memory  Manager  -  Coordination       37 

3.4.3.1  Foik 37 

3.4.3.2  Exec 38 

3.4.3.3  Paused  Process  Handling 39 

3.4.3.4  Wait  Process  Handling 39 

3.4.3.5  Signal  Handling 40 

3.4.3.6  Freed  Memory  Handling 40 

3.4.3.7  Swap  Out  Function 41 

3.4.3.8  Swap  In  Function 41 

3.4.4  File  System  -  Swap  I/O 42 

3.4.5  Swap  Device 42 

IMPLEMENTATION       44 

4.1  Introduction 44 

4.2  Kernel  Code 44 

4.2.1  clockx 44 

4.2.1.1   doclocktick 44 

4.2.2  proc.c 45 

4.2.2.1   minirec       45 

4.2.3  swapper.c 45 

4.2.3.1  swaptask 45 

4.2.3.2  trytoswin 46 

4.2.3.3  pikinsw 47 

4.2.3.4  pikoutsw 47 

4.2.3.5  setswapproc 47 

4.2.4  system.c 48 

4.2.4.1  dojork 48 

4.2.4.2  donewmap       48 

4.2.4.3  doexec 48 

4.2.4.4  doxit 49 

4.2.4.5  dojock 49 

4.2.4.6  inform 49 

4.3  MM  Code 49 


4.3.1  alloc.c 49 

4.3.1.1  allocmem 49 

4.3.1.2  totjxjle 49 

4.3.1.3  compact 50 

4.3.2  exec.c 50 

4.3.2.1  do_exec 50 

4.3.2.2  newmem 52 

4.3.2.3  load_seg       52 

4.3.3  forkexitx 52 

4.3.3.1  dojbrk 52 

4.3.3.2  mmexit       53 

4.3.3.3  do_wait 53 

4.3.4  main.c 54 

4.3.4.1  main 54 

4.3.4.2  reply 54 

4.3.5  mswap.c 54 

4.3.5.1  doswout 54 

4.3.5.2  swapout 55 

4.3.5.3  doswin 55 

4.3.5.4  swapin 57 

4.3.6  signal.c 57 

4.3.6.1  checksig 57 

4.3.6.2  unpause 57 

4.3.6.3  do_pause 57 

4.3.6.4  dumpcore 58 

4.4  FSCode 58 

4.4.1  main.c 58 

4.4.1.1   fsinit 59 

4.4.2  stadir.c       59 

4.4.2.1  do_chdir       59 

4.4.2.2  change 59 

5.   CONCLUSIONS 60 

5.1  Results       60 

5.2  Improvements  and  Further  Development 61 

BIBLIOGRAPHY 63 

APPENDIX  A  -  MODIFICATIONS  TO  KERNEL  CODE A-l 

APPENDIX  B  -  MODIFICATIONS  TO  MEMORY  MANAGER  CODE    ....  B-l 


■iii 


APPENDIX  C  -  MODIFICATIONS  TO  FILE  SYSTEM  CODE        C-l 


LIST  OF  FIGURES 

Figure  1-1.  Architecture  of  UNIX  Operating  System 5 

Figure  1-2.  Architecture  of  MINTX  Operating  System 7 

Figure  3-1.  MINIX  Process  Swapping  Overview       17 

Figure  3-2.  Original  MINIX  User  Process  States       21 

Figure  3-3.  MINIX  User  Process  States  with  Swapping 22 

Figure  3-4.  Illustration  of  Compaction 27 

Figure  3-5.  State  Diagram  for  Swap  Task 29 


LIST  OF  TABLES 
TABLE  5-1.   MINIX  NON-SWAPPING  vs  SWAPPING  BENCHMARKS  ....       62 


ACKNOWLEDGEMENTS 


My  wife,  Janine,  and  my  son,  Abraham,  have  given  me  more  love  and  support  within  the 
past  year  than  any  man  deserves.  It  is  impossible  to  return  so  much,  but  I  will  try. 

I  would  like  to  thank  Dr.  Virgil  Wallenrine  and  Dr.  Maarten  Van  Swaay  for  serving  on  my 
committee  and  for  exposing,  to  me,  a  small  portion  of  their  wealth  of  knowledge  and 
wisdom. 

I  would  also  like  to  thank  my  advisor,  Dr.  Masaaki  Mizuno,  for  sharing  his  theoretical 
knowledge,  his  technical  expertise,  and  his  humor.  His  guidance  has  kept  me  within  the 
bounds  of  the  project  while  allowing  me  the  latitude  to  explore  until  I  could  reach  and 
defend  my  own  conclusions. 

Finally,  I  would  like  to  thank  Charles  Clouse,  AT&T  Document  Development  Organization, 
for  his  help  in  formatting  this  document. 


CHAPTER  1 

1.  INTRODUCTION 

1.1  Purpose  of  Operating  Systems 

A  computer  system  consists  of  hardware  and  software  that  cooperate  to  do  useful  work. 
Computer  hardware  consists  of  the  physical  devices  that  you  can  see  and  touch,  such  as 
memory  circuits,  disk  drives,  tape  drives,  terminals,  printers,  light  pens,  keyboards,  etc. 
They  provide  most  of  the  key  computer  resources:  storage  for  data,  computing  power  for 
processing  data,  and  devices  for  input  and  output  of  data.  Another  key  resource  is  the  data 
itself. 

Computer  software,  often  referred  to  as  computer  programs,  consists  of  a  sequence  of 
logical  instructions  that  the  computer  hardware  performs  to  achieve  a  desired  result. 
Software  and  hardware,  teamed  to  form  a  computer  system,  have  the  ability  to  do  various 
functions,  from  check  book  balancing  to  highly  complex  programs  that  can  mimic  some 
parts  of  human  behavior. 

An  operating  system  is  a  computer  program  that: 

•  manages  the  resources  of  a  computer,  and 

•  provides  easy  access  to  complex  hardware. 

An  operating  system  allows  expensive  hardware  resources  to  be  shared  by  many  users.   For 


example,  a  laser  printer  that  provides  high  quality  printing  might  be  too  expensive  for  a 
single  user,  but  can  be  affordable  when  the  cost  is  shared  by  many  users.  Hardware 
resources  can  also  be  hard  to  use  because  of  their  complex  interfaces.  An  operating  system 
hides  the  hardware  complexity  by  converting  user  requests  so  that  the  hardware  can  accept 
them. 

Hardware  provides  "raw  computing  power"  and  operating  systems  make  this  power 
conveniently  available  to  users. 

1.2   MINIX  Operating  System 

1.2.1  Introduction  There  are  many  different  operating  systems,  each  built  to  satisfy 
certain  requirements.  MINIX  is  a  general  purpose,  multiprocessing,  multiuser,  operating 
system  designed  to  serve  as  an  aid  in  teaching  operating  system  concepts.  To  that  end,  it 
was  designed  to: 

•  be  small,  so  that  it  is  not  overwhelming  and  can  be  understood  by  a  student 

•  provide  an  outward  appearance  (user  interface)  that  mimics  a  popular  operating  system 
called  version  7  (v7)  UNIX1  (hence  it's  name  Mini  uNIX).  MINIX  includes  most  of  the 
system  calls  (basic  operating  system  commands),  features,  and  supporting  programs 
provided  by  UNIX. 


1 .    UNIX  is  a  trademark  of  AT&T. 


•  be  modular  to  aid  in  comprehension  and  to  encourage  modification. 

•  run  primarily  on  the  IBM-PC  and  most  compatibles  but  it  has  been  ported  to  other 
machines  (e.g.,  ATARI-ST). 

•  support  multiple  users.  However,  the  processing  power  of  the  IBM-PC  (Intel  8088)  is 
limited  such  that  only  1  user  can  comfortably  be  supported.  On  processors  more 
powerful  than  the  IBM-PC,  such  as  the  IBM-AT  (Intel  80286)  or  Intel  80386,  more  than 
one  user  can  be  supported. 

1.2.2   M1NIX  and  UNIX 

1.2.2.1  General  Description  A  brief  description  of  MINIX  and  UNIX  is  provided  here, 
for  more  information  see  references  [1]  and  [2].  MINIX  and  UNIX  consist  of  four  major 
components:  process  control,  memory  management,  file  system,  and  input/output. 

•  Process  control  handles  process  creation,  communication,  scheduling,  termination,  and 
miscellaneous  process  services  (suspension,  resumption,  memory  growth,  etc.).  In 
MINIX  and  UNIX,  this  function  is  provided  by  the  kernel. 

•  Memory  management  allocates  and  deallocates  main  memory  as  needed  by  processes. 
In  systems  that  provide  swapping  or  paging,  it  manages  the  transfer  of  process  images  to 
and  from  secondary  memory  as  well  as  allocation  and  deallocation  of  secondary 
memory.  In  MINIX,  this  function  is  provided  by  the  memory  manager  (MM).  The 
UNIX  kernel  provides  this  function. 


•  File  system  manages  secondary  memory  for  efficient  storage  and  retrieval  of  user  data. 
In  MINIX,  this  function  is  provided  by  the  file  system  (FS).  The  UNIX  kernel  provides 
this  function. 

•  Input  and  output  procedures  provide  controlled  access  to  peripheral  devices  such  as 
terminals,  tape  drives,  disk  drives,  and  network  devices  for  user  processes.  In  MINIX 
and  UNIX,  these  functions  are  provided  by  device  drivers  (also  known  as  tasks  in 
MINIX)  that  are  linked  with  the  kernel. 

1.2.2.2  UNIX  Description  Unix  was  designed  in  the  early  1970's  to  support  general 
computer  science  research  and  to  provide  a  custom  work  environment  for  its  creators.  It 
has  evolved  from  a  custom  work  environment  to  a  popular  genera!  purpose  operating 
system. 

UNIX  is  an  example  of  a  layered  operating  system.  Its  high  level  architecture  is  shown  in 
Figure  1-1.  The  rings  in  the  figure  represent  levels  of  interaction  and  privilege.  A  program 
in  a  ring  can  only  interact  with  or  get  services  from  programs  in  adjacent  rings.  For 
example,  the  cc  program  cannot  directly  interact  with  (or  request  services  from)  the  kernel. 

Only  the  operating  system  (kernel)  interacts  directly  with  the  hardware,  providing  common 
services  to  programs  and  insulating  them  from  the  hardware  idiosyncrasies.  Programs 
interact  with  the  kernel  by  invoking  a  well  defined  set  of  system  calls.  An  advantage  of 
this  design  is  that  the  rings  can  be  extended  as  much  as  the  user  prefers. 

Within  the  kernel,  there  is  no  structure  nor  information  hiding.   It  consists  of  a  collection  of 


Figure  1-1.  Architecture  of  UNIX  Operating  System 
procedures  that  are  compiled  into  a  single  object  file.    System  calls  from  user  processes 

execute  a  special  instruction  called  a  trap.  This  instruction  switches  the  machine  from  user 

mode  to  kernel  (or  supervisor)  mode.   Control  is  transferred  to  the  kernel  which  acts  on 

behalf  of  the  user  process.  The  kernel  is  not  a  separate  set  of  processes  that  run  in  parallel 

with  user  processes,  but  it  is  a  part  of  each  user  process.    In  effect,  the  user  process 

becomes  the  kernel  to  perform  the  protected  operating  system  functions  in  the  kernel  mode. 

When  the  system  call  completes,  the  machine  is  switched  back  to  user  mode  and  the 


process  resumes  in  user  mode. 

As  an  example,  consider  the  open  system  call  which  prepares  the  user  process  to  access  a 
file  for  reading  and/or  writing.  The  user  process  issues  the  system  call,  in  the  C 
programming  language,  as: 

fd  =  open(path,mode); 

where  fd  is  the  file  descriptor  used  for  future  file  access,  path  is  the  path  and  name  of  the 
file  to  be  opened,  and  mode  is  the  access  permission:  read,  write,  or  both.  The  open  system 
call  has  an  entry  point  in  the  system  call  library.  The  library,  encoded  in  assembly 
language,  contains  special  trap  instructions,  which  when  executed  cause  an  "interrupt"  that 
results  in  a  hardware  switch  from  user  mode  to  kernel  mode.  For  each  user  process  in 
UNIX,  there  exists  a  user  stack  for  use  in  user  mode  and  a  kernel  stack  for  use  in  kernel 
mode.  The  switch  to  kernel  mode  causes  a  switch  to  the  kernel  stack  and  allows  the  user 
process  to  execute  the  kernel  procedures  for  opening  a  file.  Without  getting  into  great  detail 
about  the  UNIX  file  system,  the  file,  if  found,  is  checked  for  access  permissions  for  this 
user,  prepared  for  access,  and  the  file  descriptor  is  returned  to  the  system  call  library  open 
routine.  The  system  call  library  executes  another  trap  instruction  that  results  in  a  hardware 
switch  back  to  the  user  mode.  The  file  descriptor  is  returned  to  the  user  process  which 
resumes  in  user  mode. 

1.2.2.3  MINIX  Description  MINIX  is  an  example  of  a  client-server  system.  Although  it 
provides  the  version  7  UNIX  user  interface,  the  internal  structure  and  operation  are  different 


than  UNIX.  In  a  client-server  (or  message  passing)  model,  such  as  MINIX,  as  much 
functionality  as  possible  is  removed  from  the  operating  system  leaving  a  minimal  kernel. 
Most  of  the  operating  system  functions  are  contained  in  user  processes.  To  request  a 
service  (system  call),  a  user  process  (known  as  a  client  process),  sends  the  request  to  a 
server  process  which  does  the  work  and  returns  the  results.  The  kernel  handles  the 
communication  between  the  clients  and  servers.  The  servers  run  as  user  mode  processes 
and  do  not  have  access  to  the  hardware.  The  structure  of  MINIX  is  shown  in  Figure  1-2. 
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Figure  1-2.  Architecture  of  MINIX  Operating  System 
The  layers  are  similar  to  the  UNIX  architecture  in  that  a  program  can  only  interact  with 

adjacent  layers. 


Layers  1  and  2  are  compiled  into  1  program  called  the  kernel.  The  process  management 
layer  handles  all  interrupts  and  traps,  and  provides  message  communication  between  all 
processes.    Layer  2  consists  of  input/output  processes,  typically  called  tasks  or  device 


drivers.  Although  linked  together,  they  run  as  separate  processes.  They  provide  the  device 
dependent  services  (physical  I/O  with  specific  hardware  devices). 

Layer  3  contains  two  processes  that  support  all  system  calls  for  user  processes.  Layer  3 
processes  provide  the  device  independent  services  (logical  I/O  common  to  all  devices). 

Layer  4  contains  all  user  processes.  System  calls  from  user  processes  are  handled  as  a 
client-server  transaction.  A  user  process  requests  a  system  service  by  SENDING  the  request 
to  the  appropriate  server  (FS  or  MM)  and  waiting  for  the  results.  The  server  RECEIVES 
the  request,  performs  the  service  (which  might  involve  communications  with  other  servers) 
and  returns  the  results.  Then  the  user  process  resumes. 

For  comparison  with  UNIX  operation,  a  MINIX  open  system  call  is  described.  A  user 
process  executes  the  open  system  call  using  the  same  syntax  and  interface  as  UNIX.  The 
system  call  library  function  converts  the  call  to  SEND  and  RECEIVE  statements.  These  are 
MINIX  communication  primitives.  The  library  function  then  executes  a  trap  instruction  that 
causes  a  software  interrupt  that  is  serviced  by  the  kernel.  The  MINIX  kernel  puts  the 
SEND  and  RECEIVE  requests  on  an  internal  message  queue  for  FS.  The  user  process  (via 
the  library  function)  is  then  blocked  until  both  requests  complete.  When  FS  issues  a 
RECEIVE  request  to  accept  new  work,  the  MINIX  kernel  delivers  the  user  process  message 
to  FS.  FS  copies  the  message  to  its  data  region,  determines  that  it  is  an  open  system  call, 
and  begins  to  perform  the  function.  All  data  relating  to  the  file  system  is  maintained  within 
FS.  It  is  not  directly  accessible  by  any  other  process,  including  the  kernel.  When  FS 
completes  the  function,  the  results  are  returned  using  the  SEND  request  with  the  user 


process  as  the  destination.  The  user  process  (via  the  library  function)  blocked  on 
RECEIVE,  accepts  the  results  immediately,  freeing  the  FS  SEND  block.  FS  then  issues  a 
RECEIVE  request  for  ANYone  and  awaits  new  work.  When  the  user  process  is  scheduled 
to  use  the  CPU,  it  proceeds  from  the  reception  of  the  system  call  results. 

Note  the  difference  in  system  call  handling  between  UNIX  and  MINTX.  In  UNIX,  the  user 
process  performs  the  system  call  within  the  boundaries  of  the  kernel  procedure.  In  MINIX, 
system  calls  are  performed  by  independent  server  processes  for  the  waiting  user  process 
(client).  This  has  implications  on  the  design  presented  later  in  this  document. 

An  empirical  study  of  message  oriented  operating  systems  versus  procedure  oriented 
operating  systems  is  provided  in  reference  [3].  MINIX  fits  the  definition  of  a  message 
oriented  system  and  UNIX  somewhat  fits  the  definition  of  a  procedure  oriented  system.  The 
study  demonstrates  that  those  two  types  of  operating  systems  are  duals  of  each  other  and 
that  a  system  constructed  according  to  one  model  has  a  direct  counterpart  in  the  other.  It 
concludes  that  neither  model  is  inherently  preferable,  the  main  consideration  being  the 
nature  of  the  machine  architecture,  not  the  application  that  the  system  will  support. 

1.2.3  MINIX  Limitations  MINIX  and  its  host  machine(s)  do  have  some  limitations. 
Some  of  the  notable  limitations  are: 

•  Program  Size.  MINIX  programs  are  limited  to  64K  bytes  if  compiled  as  non-separate 
text  and  data  and  128K  bytes  if  compiled  as  separate  text  and  data.  In  the  latter  case, 
the  text  and  data  regions  each  have  a  limitation  of  64K  bytes.    The  program  size 
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limitation  is  due  to  the  hardware  memory  management  unit  of  the  host  computer. 

•  Memory  Size.   The  main  memory  size  for  a  MTNTX  system  is  limited  to  640K  bytes. 
Again,  this  is  a  hardware  imposed  limitation. 

1.2.3.1  MTNTX  Problem  A  user  session  with  MINIX  occasionally  results  in  frustration 
because  of  the  main  memory  size  limitation.  Main  memory  use  on  an  IBM-PC  running 
MINIX  varies,  but  a  typical  distribution  is: 

MEMORY  LOCATION  RESIDENT  PROGRAMS/DATA 

0  -  120K  MINIX  Operating  System 

120K  -  360K  root  File  System  RAM  Disk 

360K  -  640K  User  Process  Area 

The  amount  of  main  memory  available  for  user  processes  is  about  280K  bytes.  This  allows 
about  four  64K  bytes  processes  to  run  concurrently.  Additional  processes  will  be  rejected 
due  to  lack  of  main  memory,  even  though  the  currently  running  processes  might  be  blocked 
and  ample  CPU  power  is  available.  On  a  multi-process  system  such  as  MINIX,  it  is  not 
uncommon  to  have  many  processes  active,  even  for  a  single  user.  For  example,  the  user 
might  type: 

$  Is  -al  |  grep  "[a-m]*.c"  |  more 

to  list  specific  files  in  the  current  directory  and  view  them  a  screenful  at  a  time.  The  count 
of  active  processes  is:    user  command  shell,   a  shell   spun  off  to  handle  the  multiple 
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commands  line,  Is  command,  grep  command,  and  the  more  command,  for  a  total  of  5 
processes.  In  this  example,  the  user  command  shell  is  blocked  until  all  commands 
complete;  it  can  be  swapped  out  if  necessary.  The  design  and  implementation  discussed  in 
this  paper  address  the  memory  limitation  problem. 
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CHAPTER  2 


2.  REQUIREMENTS 

This  chapter  presents  the  goals  and  objectives  that  the  design  and  implementation  must  meet 
to  provide  a  useful  enhancement  to  the  MIN1X  operating  system. 

2.1  Extend  Memory 

The  1.3  version  of  MINIX  provides  about  280K  bytes  of  main  memory  (about  520K  bytes 
on  an  AT  model)  for  user  programs.  Occasionally,  this  limit  is  reached  without  exceeding 
the  processing  capability  of  the  CPU.  This  results  in  a  refusal  to  perform  the  user  requested 
command  and  causes  user  disappointment  and  frustration.  The  system  should  provide  a 
means  to  overcome  the  main  memory  limitation  and  allow  full  use  of  the  CPU.  This  will 
allow  more  useful  work  to  be  done  and  will  promote  user  satisfaction. 

2.2  Maintain  Existing  Functions 

Operating  system  modifications  should  not  affect  the  current  functions  provided  by  MINIX. 
Additional  functions  are  permissible  only  if  they  do  not  affect  the  current  functions. 

2J   Maintain  MINIX  Structure 

As  described  in  Chapter  1,  MINIX  is  a  highly  structured,  modular  operating  system.  The 
design  should  maintain  that  structure  to  allow  future  enhancements  to  be  developed  with  the 
same  philosophy. 
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2.4  User  Administration 

The  installation  and  maintenance  of  the  modification  should  be  minimal  and  straightforward. 
Automated  installation  procedures  should  be  provided.  Maintenance  procedures  should  be 
clearly  documented. 

2.5  Performance 

The  implementation  should  incur  minimal  performance  degradation  and  main  memory  usage 
so  as  not  to  offset  the  performance  gain.  When  the  new  functionality  is  not  in  use,  the 
system  performance  should  equal  that  of  the  current  system.  When  in  light  use,  the  system 
performance  degradation  should  not  exceed  25%.  When  in  heavy  use,  the  CPU  limitations 
might  be  exceeded;  this  cannot  be  avoided. 
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CHAPTER  3 

3.  DETAILED  DESIGN 

3.1  Introduction 

This  chapter  presents  and  defends  a  design  that  meets  the  requirements  presented  in  Chapter 
2.  A  process  swapper  that  shuttles  process  images  between  main  memory  and  secondary 
memory  has  been  chosen  to  satisfy  the  requirements.  The  design  is  applied  to  the  1.3 
version  of  MINIX. 

3.2  Process  Swapping 

3 2.1  Purpose  of  Swapping  Process  memory  images2  must  reside  in  main  memory  to  run. 
If  main  memory  is  not  available,  a  process  cannot  be  created.  Process  swapping  allows 
more  processes  to  run  than  can  fit  into  the  available  main  memory.  It  does  this, 
transparently  to  the  user,  by  shuttling  process  images  between  main  memory  and  secondary 
memory  (called  the  swap  device).  Clearly,  a  process  on  secondary  memory  requires  longer 
access  time  than  a  process  in  main  memory,  so  low  priority,  inactive  processes  are  chosen 
for  temporary  swap  out.  When  the  swapped  out  processes  are  ready  to  run  and  main 
memory  is  available,  they  are  moved  back  into  main  memory. 


2.  A  process  memory  image  consists  of  the  data  and  instructions  that  must  reside  in  main  memory  for  the 
process  to  run.  It  can  include  the  process  text,  data,  stack,  and  the  process  table  slot.  Within  the  context  of 
this  document,  the  process  memory  image  consists  of  the  process  text,  data,  and  stack  regions  which  can  be 
swapped  out  and  in. 
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3.2.2  Swapping  Functions  The  functions  provided  by  swapping  are: 

•  provide  for  process  creation  when  main  memory  is  not  available  by  creating  the  new 
process  image  on  the  swap  device. 

•  swap  in  runnable  processes  and,  if  necessary,  free  main  memory  by  swapping  out 
inactive  processes. 

•  choose  appropriate  processes  for  swap  in/out. 

•  manage  space  on  the  swap  device. 

3.2.3  Alternatives  to  Swapping 

3.2.3.1  Increased  Memory  It  is  not  feasible  to  increase  the  amount  of  main  memory 
beyond  640K  bytes  on  an  IBM-PC.  The  IBM-AT  model  can  provide  up  to  384K  bytes  of 
extended  memory.  Currently,  MINIX  can  use  the  extended  memory  only  to  support  the 
root  file  system  RAM  disk.  This  frees  240K  bytes  of  main  memory  for  user  programs  and 
is  an  excellent  alternative  to  swapping  for  that  machine. 

3.2.3.2  Demand  Paging  Swapping  transfers  the  entire  process  image  between  main 
memory  and  the  swap  device.  An  alternative  to  swapping  is  demand  paging  which  transfers 
individual  memory  pages  instead  of  entire  processes  to  and  from  a  secondary  device. 
Demand  paging  permits  greater  flexibility  in  mapping  the  virtual  address  space  of  a  process 
into  the  physical  memory  of  a  machine,  usually  allowing  the  size  of  a  process  to  be  greater 
than  the  amount  of  available  physical  memory  and  allowing  more  processes  to  fit 
simultaneously  in  main  memory.   Unfortunately,  the  IBM-PC  does  not  provide  hardware  to 
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support  demand  paging. 


33  General  Design 


3.3.1  Introduction  The  swapping  design  affects  3  parts  of  the  MINIX  operating  system: 
the  kernel,  memory  manager  (MM),  and  the  file  system  (FS).  An  overview  of  MINK  with 
the  swapping  design  is  shown  in  Figure  3-1.  The  figure  shows  most  MINIX  components 
but  shows  the  communications  only  concerned  with  swapping. 

Recall  from  chapter  1  that  tasks  are  usually  interfaces  to  hardware  devices.  They  are  all 
linked  together  in  the  MINIX  kernel,  but  they  each  behave  as  a  separate  process.  This 
gives  them  independence  and  access  to  kernel  variables,  including  the  kernel  process  table. 
A  new  task,  the  swap  task,  was  added  to  the  kernel  to  manage  the  overall  swapping 
function.  Unlike  other  tasks,  it  performs  no  hardware  interfacing.  It  communicates  with 
other  tasks  and  the  memory  manager  (MM),  using  normal  MINIX  communications 
primitives,  to  manage  the  swapping  function. 

The  swap  task  relies  on  the  memory  management  functions  of  MM.  MM  is  responsible  for 
setup  and  completion  of  all  swap  ins  and  swap  outs.  MM  initiates  fork  and  exec  swaps 
and,  upon  command  from  the  swap  task,  performs  swap  ins  and  forced  swap  outs.  MM 
requests  physical  I/O  for  swapping  via  normal  user  system  calls  to  the  file  system  (FS). 

The  physical  swapping  input/output  and  swap  device  management  is  performed  by  the  file 
system  (FS).  The  swap  device  is  the  area  on  secondary  memory  used  to  store  process 
memory  images  that  have  been  swapped  out  of  main  memory. 
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LAYERS  1  &  2 
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(hard  disk  drive) 


LAYER  4 


Figure  3-1.  MIN1X  Process  Swapping  Overview 
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3.3.2  Overview  This  part  describes  a  typical  situation  that  involves  the  entire  swapping 
design  (Figure  3-1). 

Assume  that  a  user  process  issues  the  fork  system  call  to  create  a  new  process  that  is  a  copy 
of  itself  (i.e.,  create  a  child  process).  This  is  typical  in  multiprocessing  type  operating 
systems.  The  memory  manager  (MM)  receives  the  request  from  the  user  process  and  starts 
the  fork  function,  fork  must  allocate  main  memory  for  the  new  process  image.  If  main 
memory  is  not  available,  then  the  new  process  image  is  not  copied  to  main  memory  but  is 
copied  to  the  swap  device  instead.  This  is  done  within  MM  using  normal  user  system  calls 
to  FS.  The  calling  user  process  memory  image  is  written  to  a  specific  directory  within  a 
file  system  on  secondary  memory  (disk).  The  directory,  called  the  swap  device,  is  a 
repository  for  all  swapped  out  process  images.  When  that  is  complete,  MM  sends  a 
message  (via  the  system  task)  to  the  swap  task  to  indicate  that  a  runnable  process  is  on  the 
swap  device.  After  the  swap  out  MM  returns  the  process  id  of  the  child  process  to  the 
parent  process  just  as  in  a  normal  fork. 

The  swap  task  receives  the  message  and  chooses  the  most  eligible,  runnable  process  on  the 
swap  device  to  swap  in.  It  sends  a  swap  in  request  to  MM  and  awaits  the  results. 

If  main  memory  is  available,  MM  swaps  in  the  process  using  normal  user  system  calls  to 
FS  to  read  the  user  process  memory  image  from  the  directory  (swap  device)  and  copy  it  to 
main  memory.  When  the  swap  in  has  completed,  MM  notifies  the  swap  task  that  the 
process  image  has  been  swapped  in.  If  main  memory  is  not  available,  MM  notifies  the 
swap  task  of  the  failure.  MM  also  provides  information  to  the  swap  task  to  aid  in  choosing 
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a  process  to  swap  out. 

If  the  swap  in  succeeded,  the  swap  task  checks  for  more  runnable  processes  on  the  swap 
device  and  attempts  to  swap  them  in  in  the  same  manner.  If  the  swap  in  failed,  then  to  free 
main  memory  for  the  runnable  process  on  the  swap  device,  the  swap  task  chooses  an 
eligible  process  in  main  memory  to  swap  out.  It  uses  the  information  provided  by  MM  to 
aid  in  selection.  If  it  cannot  find  a  process  to  swap  out,  the  swap  task  idles  until  it  receives 
an  indication  that  main  memory  has  been  freed  or  that  a  process  in  main  memory  might 
now  be  eligible  to  swap  out  When  a  process  is  chosen  for  swap  out,  the  swap  task  sends  a 
swap  out  request  to  MM  and  awaits  the  results. 

MM  swaps  out  the  requested  process  image  using  normal  user  system  calls  to  FS  to  write 
the  user  process  memory  image  to  the  swap  device.  When  the  swap  out  has  completed, 
MM  notifies  the  swap  task. 

The  swap  task  then  chooses  the  most  eligible  runnable  process  on  the  swap  device  to  swap 
in.  As  previously  described,  it  requests  MM  to  swap  in  the  process. 

This  cycle  continues  until  there  are  no  runnable  processes  on  the  swap  device.  Processes 
chosen  by  the  swap  task  for  swap  out  can  be  runnable  or  blocked  on  a  system  call. 
Runnable  processes  are  eligible  for  swap  in  immediately.  Blocked  processes  are  not  eligible 
for  swap  in  until  the  system  call  has  been  completed.  When  a  blocked  process  on  the  swap 
device  becomes  runnable,  a  message  is  sent  to  the  swap  task  to  start  the  swap  in  procedure. 
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333  Swapping  Functions  From  the  Overview,  three  main  swap  functions  are  identified: 
managing  space  on  the  swap  device,  swapping  processes  out  of  main  memory,  and 
swapping  processes  into  main  memory. 

3J 3.1  Swap  Device  Swapping  introduces  another  resource  that  the  operating  system  must 
manage,  the  swap  device.  The  swap  device  is  the  area  on  secondary  memory  used  to  store 
process  memory  images  that  have  been  swapped  out  of  main  memory.  The  swap  device 
can  be  considered  an  extension  of  main  memory  with  a  large  size  and  limited  capability. 
Its  capability  is  limited  to  storing  process  memory  images  because  processes  cannot  use  the 
CPU  while  on  the  swap  device.  Its  advantages  are  that  a  process  can  be  kept  alive  while  on 
the  swap  device,  a  system  call  placed  before  the  process  was  swapped  out  can  progress 
(because  it  is  performed  by  another  process:  MM  or  FS),  and  the  main  memory  that  the 
process  is  not  using  can  be  used  by  another  running  process. 

333J2  Swapping  Out  Processes  Figure  3-2  shows  the  original  MTNIX  user  process 
states.  Processes  created  by  fork  and  overlaying  programs  introduced  by  exec  ("new  proc") 
are  placed  in  main  memory  and  then  the  process  is  placed  on  the  RUN  queue  ("on  RUN 
queue").  However,  if  main  memory  is  not  available,  then  the  process  creation  fails.  With 
the  introduction  of  swapping,  these  failures  become  candidates  for  swap  out.  Figure  3-3 
shows  a  new  state  ("SWAPPED  &  RUNNABLE")  to  which  these  failures  go  when  main 
memory  is  not  available.  In  this  state  the  process  memory  images  are  on  the  swap  device. 
Later,  when  main  memory  is  made  available,  they  are  swapped  in  (see  "Swapping  In 
Processes"). 
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Figure  3-2.  Original  MINIX  User  Process  States 
It  is  necessary,  at  tunes,  to  force  a  process  out  of  main  memory  to  free  space  for  a  runnable 

process  currently  on  the  swap  device.   Processes  must  be  chosen  carefully  for  forced  swap 

out.  Recall  that  when  a  user  process  performs  a  system  call  it  is  blocked  until  the  system 

call  is  completed  by  MM  or  FS.    Most  system  calls  result  in  data  being  transferred  to  or 

from  the  user  process  data  region  in  main  memory.   MM  and  FS  rely  on  the  fact  that  a 
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Figure  3-3.  MTNIX  User  Process  States  with  Swapping 
process  will  always  be  in  main  memory.   Therefore,  a  process  that  is  chosen  for  swap  out 


should  not  be  waiting  for  I/O.  This  affects  which  processes  are  eligible  to  swap  out. 
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For  example,  suppose  that  a  user  process  wishes  to  write  some  data  to  an  I/O  device  (state 
"RUNNING"  in  Figure  3-2).  It  issues  a  write  system  call  to  the  operating  system.  This  is 
done  by  SENDing  a  message  to  FS  and  awaiting  the  result  by  issuing  a  RECEIVE  from  FS. 
The  user  process  is  therefore  blocked  on  SEND  and  RECEIVE  ("blocked  on  SEND  & 
RCV").  When  FS  issues  a  RECEIVE  (to  accept  new  work  from  ANY  source),  the  message 
passing  mechanism  of  the  operating  system  passes  the  system  call  write  request  to  FS.  The 
user  process  is  now  freed  from  the  SEND  block  and  is  blocked  only  on  RECEIVE  from  FS 
("blocked  on  RCV").  At  this  point  FS  is  dedicated  to  processing  the  user's  write  request. 
There  is  no  way  to  stop  it.  Data  will  be  read  from  the  user's  area  and  copied  to  the  I/O 
device.  If  the  user  process  is  swapped  out  BEFORE  the  data  has  been  retrieved  from  its 
data  region,  FS  must  wait  until  the  user  process  is  swapped  back  into  core.  Currently,  FS  is 
not  designed  to  wait  for  an  indeterminate  amount  of  time  for  that  to  occur.  Similar 
problems  exist  with  other  system  calls. 

In  UNIX,  the  problem  is  simplified  because  the  user  process  itself  traps  to  kernel  mode  to 
perform  operating  system  functions.  When  the  user  process  is  in  core  and  in  the  kernel 
mode,  just  before  it  does  an  I/O  operation,  it  marks  itself  as  being  ineligible  for  a  swap  out. 
It  stays  marked  until  the  I/O  has  completed.  When  the  user  process  is  swapped  out,  no 
progress  can  be  made  on  the  system  call  because  the  user  process  itself  performs  the 
protected  kernel  procedures  in  kernel  mode.  Therefore,  an  I/O  operation  cannot  begin  while 
the  process  is  swapped  out 

Clearly,  the  UNIX  solution  cannot  be  used.    In  MINIX,  one  solution  is  to  mark  all  user 
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processes  that  do  system  calls  requiring  their  data  region  to  be  read  or  written  as  ineligible 
for  swap  out.  This  requires  marking  nearly  all  system  calls  because  the  message  passing 
scheme  uses  memory  pointers  to  the  user  area  to  pass  system  call  parameters  and  to  return 
data  and  responses.  It  would  not  make  a  significant  difference  to  single  out  the  few  that  do 
not  access  the  user  process  data  region. 

Another  solution  is  to  buffer  I/O  requests  for  swapped  out  processes  so  that  I/O  is  delayed 
until  the  process  is  swapped  in.  This  is  a  complex  modification  to  MINIX  and  the 
buffering  required  would  increase  the  operating  system  size. 

This  design  chooses,  for  forced  swap  out,  those  processes  that  are  blocked  for  an  indefinite 
amount  of  time  and  are  expecting  a  minimal  amount  of  I/O.  Conveniently,  MINIX  clearly 
identifies  such  processes  in  the  MM  process  table.  They  are  processes  blocked  on  the 
system  calls  pause  and  wait.  They  are  ineligible  to  run  until  the  system  call  completes  so 
they  are  excellent  candidates  for  swap  out.  This  is  illustrated  in  the  user  process  state 
diagram  Figure  3-3  by  the  new  state  ("SWAPPED  &  BLOCKED  ON  RCV").  These  two 
system  calls  generate  minimal  I/O  that  is  easily  buffered  by  the  kernel  until  the  process  is 
swapped  in.  When  the  system  call  completes,  the  ("SWAPPED  &  RUNNABLE")  state  is 
entered  and  the  process  is  ready  for  swap  in. 

A  secondary  choice  for  processes  to  swap  out,  is  processes  that  are  not  blocked  at  all.  They 
are  expecting  no  I/O  because  they  are  currently  CPU  bound.  It  is  an  easy  matter  to  swap 
them  out  ("SWAPPED  &  RUNNABLE").  They  are  not  an  ideal  choice  for  swap  out 
because  they  are  runnable,  but  they  should  rarely  get  chosen.   In  situations  where  they  do 
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get  swapped  out  (no  blocked  processes  remaining  in  core),  they  provide  a  method  for 
sharing  the  CPU  fairly  among  all  runnable  processes. 

A  third  possibility  for  forced  swap  out  is  processes  blocked  on  both  SEND  and  RECEIVE. 
This  has  been  considered  and  it  is  feasible,  but  because  of  the  added  complexity,  it  was  not 
implemented. 

Another  problem  with  swapping  out  processes  concerns  the  location  in  main  memory  of  the 
swapped  out  process  and  how  it  relates  to  the  process  to  be  swapped  in.  MINIX  uses  the 
first  fit  algorithm  when  allocating  main  memory  for  new  processes.  Suppose  that  a  number 
of  processes  reside  in  main  memory  with  a  total  free  space  of  20K  bytes.  A  process  on  the 
swap  device  with  a  size  of  18K  bytes  is  runnable  and  should  be  swapped  in.  However,  the 
free  main  memory  is  fragmented  and  the  largest  amount  of  contiguous  free  main  memory  is 
10K  bytes  (Figure  3-4a).  Should  this  require  a  swap  out  of  a  program  or  group  of  programs 
whose  size  is  >=  18K  bytes,  or  a  swap  out  of  a  program(s)  next  to  a  free  block(s)  such  that 
program(s)  size  +  free  block(s)  size  >=  18K  bytes?  This  idea  is  suggested  in  references  [4] 
and  [5].  One  advantage  is  that  no  processes  are  swapped  out  if  a  large  enough  cluster 
cannot  be  found.  This  reduces  the  number  of  unnecessary  swap  outs.  (The  standard  UNIX 
swapper  swaps  out  processes  even  though  doing  so  might  not  result  in  a  large  enough  area 
for  the  incoming  process.  So  unnecessary  swapping  occurs.) 

Including  this  type  of  requirement  in  the  algorithm  for  choosing  a  process  for  swap  out 
makes  it  more  complex  and  results  in  a  smaller  set  of  processes  from  which  to  choose. 
Recall  that  only  a  special  subset  of  all  user  processes  is  eligible  for  swap  out  (runnable  or 
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blocked  on  pause  and  wait).  An  alternative  is  to  compact  memory  before  the  swap  in  such 
that  a  contiguous  block  of  free  memory  of  size  20K  bytes  resides  in  high  memory  (Figure 
3-4b).  The  process  can  then  be  swapped  into  the  free  memory  (using  first  fit)  and  a 
potential  swap  out  is  eliminated  (Figure  3-4c).  The  performance  cost  of  memory 
compaction  on  the  IBM-PC  is  certainly  less  than  the  cost  of  a  swap  out  to  secondary 
memory.  In  addition,  compaction  simplifies  the  algorithm  to  choose  a  process  to  swap  out. 
For  example,  if  an  18K  bytes  runnable  process  is  on  the  swap  device  and  total  free  main 
memory  is  10K  bytes  and  is  fragmented,  then  all  that  is  required  to  provide  space  for  the 
swap  in  is  to  find  an  eligible  incore  program  >=  8K  bytes,  swap  it  out,  and,  if  necessary, 
compress  memory.  The  process  can  then  be  swapped  into  the  free  memory.  The  overhead 
of  a  compaction  is  incurred  but  is  justified  if  the  following  is  considered.  The  amount  of 
main  memory  available  for  MINIX  user  processes  is  small  (about  280K  bytes)  and  the 
actual  number  of  processes  is  small  (process  table  can  accommodate  15  user  processes 
maximum).  Because  there  is  not  a  lot  of  main  memory  to  compact,  compaction  is  quick 
and  because  there  are  not  a  lot  of  processes  from  which  to  choose,  the  compression 
algorithm  will  find  an  eligible  process  for  swap  out  much  more  often  than  the  other 
algorithm.  In  addition,  the  compression  algorithm  can  choose  more  appropriate  processes 
for  swap  out. 

3.3.3.3  Swapping  In  Processes  It  now  becomes  an  easy  matter  to  determine  which 
processes  are  candidates  for  swap  in:  processes  in  the  process  state  ("SWAPPED  & 
RUNNABLE").  A  priority,  based  on  residence  time,  is  placed  on  the  processes  in  that  state 
to  facilitate  the  choice. 
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Figure  3-4.  Illustration  of  Compaction 
3.4  Detailed  Module  Description 
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3.4.1  Introduction  The  3  swapping  functions  just  described  are  provided  by  various  parts 
of  the  operating  system:  kernel  provides  management  and  decision  making,  MM  provides 
coordination,  and  FS  provides  primitive  swap  out/in  operations.  The  design  changes  to  the 
kernel,  MM,  and  FS  to  provide  swapping  are  described  here. 

3.4.2  Kernel  -  Management  and  Decisions 

3.4.2.1  Swap  Task  The  swap  task  makes  all  swapping  decisions.  The  swap  task  was 
implemented  as  a  kernel  task  for  the  following  reasons: 

•  it  can  be  closely  tied  to  process  scheduling 

•  it  needs  timely  access  to  some  kernel  variables. 

The  swap  task,  normally  idle,  awaits  messages  concerning  system  swap  status  and  it 
responds  appropriately.  Two  variables  define  the  state  of  the  swap  task:  swapstate  and 
inprogress.  Swapstate  can  have  the  following  values: 

-  IDLE  -  no  runnable  processes  on  the  swap  device 

•  SWAJMN  -  runnable  process  on  the  swap  device. 

A  runnable  process  is  a  process  that  is  not  blocked  on  SEND  and/or  RECEIVE.  Inprogress 
has  the  following  values: 

■  NONE  -  no  swap  operation  in  progress 

•  SWAPIN  -  a  swap  in  is  in  progress 
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SWAPOUT  -  a  swap  out  is  in  progress. 


A  complete  swap  task  state  diagram  is  shown  in  Figure  3-5. 
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IS  FREE 
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SWAP  OUT  FAIL 


CORE  IS  NEEDED 


Figure  3-5.  State  Diagram  for  Swap  Task 
The  swap  task  is  described  in  terms  of  its  state  and  transitions. 
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3.4.2.1.1  Swap  Task  States  State  1  is  entered  at  system  startup.  There  are  no  runnable 
processes  on  the  swap  device  and  the  swap  task  idles  whenever  in  this  state. 

State  2  is  entered  when  a  runnable  process  is  on  the  swap  device  and  an  attempt  is  being 
made  by  the  swap  task  to  swap  it  in. 

State  3  is  entered  when  a  runnable  process  is  on  the  swap  device,  main  memory  is  not 
available,  and  a  swap  out  is  being  attempted  by  the  swap  task  to  free  main  memory. 

State  4  is  entered  when  a  runnable  process  is  on  the  swap  device,  main  memory  is  not 
available,  and  no  in  core  processes  are  eligible  for  swap  out.  The  swap  task  idles  until 
memory  is  freed  or  a  process  becomes  eligible  for  swap  out. 

3.4.2.1.2  Swap  Task  Transitions  The  swap  task  can  receive  six  different  messages  from 
MM  and  other  tasks.  The  following  describes  how  each  of  the  messages  (or  events)  affects 
the  swap  task  states  and  how  the  swap  task  responds. 

COREISNEEDED:  This  message  is  received  when  a  runnable  process  is  on  the  swap 
device.  It  is  the  swap  task's  responsibility  to  move  it  into  main  memory  as  quickly  as 
possible.  This  message  is  sent  from: 

—  MM  when  a  process,  on  the  swap  device,  that  was  blocked  on  the  pause  or  wait  system 
call  has  become  unblocked. 

—  MM  (via  the  system  task)  when  zfork  or  exec  swap  has  occurred. 

If  the  swap  task  is  in  state  1,  the  swap  task's  response  is  to  change  the  swap  status  from 
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SWAPIDLE  to  SWAPIN  (state  1  to  state  2)  and  try  to  swap  in  the  ninnable  process.  All 
runnable  processes  on  the  swap  device  will  be  evaluated  and  the  most  eligible  will  be 
chosen  (see  "Swap  In  Algorithm"  below).  The  swap  task  then  sends  a 
SWAP  INREQUEST  message  to  MM  to  swap  in  the  chosen  process.  MM  will  attempt  to 
swap  in  the  process  and  will  return  either  a  SWAPINCOMPLETE  or  SWAPIN  FAIL 
message  to  the  swap  task.  The  swap  task  can  have,  at  most,  one  swap  in  or  one  swap  out 
operation  in  progress. 

SWAPINCOMPLETE:  This  message  is  received  when  a  swap  in,  requested  by  the  swap 
task  (from  state  2),  has  successfully  completed.  It  is  sent  by  MM  after  swap  in  completion 
and  should  be  received  only  while  in  state  2.  The  swap  task  responds  by  readying  the 
process  for  execution.  Then  the  swap  task  checks  for  more  runnable  processes  on  the  swap 
device  (and  stays  in  state  2  or  goes  to  state  1). 

SWAPJNFAIL:  This  message  is  received  when  a  swap  in,  requested  by  the  swap  task 
(from  state  2),  has  failed.  It  is  sent  by  MM  after  attempting  a  swap  in.  The  reason  for 
failure  is  assumed  to  be  not  enough  main  memory  for  the  process.  MM  also  sends 
additional  information  to  help  the  swap  task  choose  a  process  for  swap  out:  a  list  of  all 
processes  that  are  blocked  on  the  pause  and  wait  system  calls  and  the  amount  of  main 
memory  required  to  swap  in  the  requested  process.  Based  on  the  information  returned  by 
MM,  the  swap  task  responds  by  choosing  a  process  to  swap  out  so  that  main  memory  can 
be  freed  to  allow  the  runnable  process  into  main  memory.  It  then  marks  the  chosen  process 
as  SWAPPED  and  sends  a  SWAPOUTREQUEST  to  MM  (state  3).   If  an  eligible  process 
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for  swap  out  cannot  be  found  (state  4),  then  the  swap  task  sends  a  NOSWAPOUT 
message  to  the  waiting  MM  and  idles  until  a  COREISFREE  message  is  received. 

CORE_IS_FREE:  This  message  is  received  when  a  parameter  that  controls  swap  out  has 
changed.  If  the  swap  task  is  in  state  4,  it  can  re-evaluate  all  in  core  processes  to  find  an 
eligible  swap  out  candidate.  Changeable  swapping  parameters  are:  amount  of  main  memory 
available  and  process  residence  time.  The  other  parameter  is  process  memory  image  size. 
This  message  is  sent  from: 

—  MM  (via  the  system  task)  when  a  process  has  exited  meaning  that  main  memory  has 
become  available. 

—  MM  when  a  process  has  blocked  itself  (i.e.,  it  has  called  the  pause  or  wait  system  call), 
meaning  that  main  memory  can  be  freed  by  swapping  out  the  blocked  process. 

—  Clock  task  after  the  process  residence  times  have  increased.  This  means  that  a  process 
might  now  be  resident  long  enough  to  be  eligible  for  swap  out  to  free  main  memory. 

The  swap  task  (state  4)  responds  by  evaluating  all  runnable  processes  on  the  swap  device 
and  trying  to  swap  in  the  most  eligible,  hoping  that  enough  core  is  available  (state  2).  If 
the  swap  in  fails,  then  the  swap  task  will  evaluate  all  processes  in  main  memory  for  swap 
out  (and  go  to  state  3  or  4). 

SWAPOUTCOMPLETE:  This  message  is  received  when  a  swap  out,  requested  by  the 
swap  task  (from  state  3),  has  successfully  completed.  It  is  sent  by  MM  after  swap  out 
completion.  The  fork  and  exec  swaps  are  reported  by  the  COREJS  NEEDED  message.  A 
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forced  swap  out  is  perfomied  only  when  main  memory  space  is  needed  for  an  incoming 
process.  So  the  reception  of  this  message  means  that  there  must  be  a  runnable  process  on 
the  swap  device.  The  swap  task  responds  by  marking  the  swapped  out  process  as 
NOMAP,  setting  the  residence  rime  to  0,  and  marking  the  process  as  BLOCKED  if  it  is 
blocked.  Then  the  swap  task  goes  to  state  2  and  tries  to  swap  in  the  most  eligible  process 
(most  likely,  the  process  that  caused  the  forced  swap  out). 

SWAPOUTFAILED:  This  message  is  received  when  a  swap  out,  requested  by  the  swap 
task  (from  state  3),  has  failed.  It  is  sent  by  MM  to  indicate  the  failure  which  is  assumed  to 
be  caused  by  lack  of  space  on  the  swap  device.  The  swap  task  responds  by  marking  the 
process  as  not  SWAPPED  and  then  waiting  for  the  next  COREISFREE  message  (state  4). 

3.4.2.U  Swap  Out  Algorithm  Processes  for  swap  out  are  chosen  by  the  swap  task  based 
on  the  following  attributes: 

•  Size.  The  process  to  be  swapped  out  should  be  equal  to  or  larger  than  the  amount  of 
main  memory  needed  for  the  swap  in  less  the  current  amount  of  free  main  memory. 
This  reduces  the  number  of  swap  outs  that  would  occur  if  size  were  not  a  factor.  (Size 
is  not  considered  in  the  standard  UNIX  swap  implementations.  Studies  (see  reference 
[6])  have  shown  that  size  considerations  can  reduce  the  number  of  swaps.)  When  MM 
returns  a  SWAPINFAIL  message,  it  also  sends  the  size  of  additional  main  memory 
needed  for  the  swap  in.  The  size  calculation  by  MM  includes  the  total  amount  of 
existing  free  memory. 
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•  Process  State.  The  process  to  be  swapped  out  must  be  blocked  on  pause  or  wait  or  be 
runnable.  all  processes  currently  blocked  on  pause  and  wait  are  marked  in  the  MM 
process  table.  When  MM  returns  a  SWAPINFAEL  message,  it  also  sends  a  list  of  all 
processes  currently  blocked  on  pause  and  wait.  These  processes  have  a  higher  priority 
for  swap  out  than  runnable  processes. 

■  Time  in  Core.  Thrashing  is  a  condition  in  which  one  or  more  process  images  are 
continually  swapped  in  and  out.  System  resources  are  expended  performing  the 
swapping  rather  than  performing  useful  work.  Thrashing  can  occur  in  this  design  when 
runnable  processes  are  swapped  out  because  a  runnable  process  on  the  swap  device  is 
immediately  available  for  swap  in.  If  another  runnable  process  is  chosen  for  swap  out, 
then  thrashing  can  occur.  To  prevent  thrashing,  a  condition  for  swap  out  is  that  a 
process  must  reside  in  main  memory  for  a  minimal  amount  of  time.  Within  that 
minimal  amount  of  time,  the  process  should  get  access  to  the  CPU  and  progress.  A 
potential  swap  out  is  delayed  due  to  lack  of  eligible  swap  out  candidates  so  a  runnable 
process  might  spend  a  bit  longer  on  the  swap  device.  However,  system  resources  are 
expended  doing  work  more  useful  than  swapping  and  all  users  experience  better 
performance. 
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The  following  table  shows  the  swap  out  priority  highest  to  lowest: 

SIZE  >=  NEEDED  BLOCK  TYPE  CORE  RESIDENCE 

Y  pause/wait  oldest  in  this  category 

Y  none  oldest  above  minimal 
N  pause/wait  oldest  in  this  category 
N  none  oldest  above  minimal 

3.4.2.1.4  Swap  In  Algorithm  Only  runnable  processes  are  chosen  for  swap  in.  The 
runnable  process  that  has  been  on  the  swap  device  the  longest  is  chosen  for  swap  in.  No 
minimum  amount  of  time  on  the  swap  device  is  required  because  thrashing  is  minimized  by 
the  swap  out  algorithm. 

3.4.2.2  Clock  Task  The  clock  task  has  2  swapping  related  duties: 

•  Every  second,  the  clock  task  increments  the  residence  time  of  each  user  process.  The 
residence  time  is  the  amount  of  time  that  the  process  has  recently  been  in  its  current 
residence  (main  memory  or  on  the  swap  device).  It  is  set  to  zero  whenever  a  process  is: 

—  created, 

—  moved  into  main  memory, 

—  and  moved  onto  the  swap  device. 

Residence  time  is  used  as  a  parameter  in  determining  the  eligibility  of  a  process  for 
swap  out   Thrashing  is  minimized  by  requiring  that  a  process  remain  in  main  memory 
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for  a  minimum  amount  of  time  before  being  swapped.  Residence  time  is  also  an 
attempt,  although  weak,  to  allow  equal  access  to  the  CPU  for  all  processes.  This  is 
especially  true  for  runnable  processes  that  are  chosen  for  swap  out. 

•  The  clock  task  checks  the  status  of  the  swap  task  every  second.  If  a  runnable  process  is 
on  the  swap  device  and  the  swap  task  is  not  in  a  swap  in  state,  the  clock  task  will  send 
a  COREISFREE  message  to  the  swap  task.  This  means  that  residence  times  have 
changed  and  that  it  might  now  be  possible  to  find  an  eligible  process  for  swap  out.  The 
swap  task  can  then  re-evaluate  all  processes. 

3.4.2.3  System  Task  MM  and  FS  do  not  have  direct  access  to  the  kernel  variables  and 
kernel  process  table.  The  system  task  is  their  interface  to  the  kernel  to  get  data,  set  kernel 
variables,  and  perform  miscellaneous  duties.  Some  of  the  existing  communications  between 
the  system  task  and  FS  and  MM  are  used  as  vehicles  for  communicating  swapping  related 
data.  The  swapping  related  duties  performed  by  the  system  task  are: 

•  When  MM  performs  a /or*,  it  notifies  the  kernel  to  set  up  the  kernel  process  table.  The 
swapping  design  takes  advantage  of  the  existing  fork  communication  when  a  fork  swap 
occurs.  If  the  fork  message  from  MM  indicates  that  a  fork  swap  has  occurred,  the 
system  task  marks  the  kernel  process  table  slot  as  SWAPPED,  and,  if  the  swap  task  is 
SWAPIDLE,  sends  a  CORE_IS_NEEDED  message  to  the  swap  task. 

•  When  MM  performs  an  exec,  it  notifies  the  kernel  to  set  up  the  kernel  process  table.  If 
the  exec  message  from  MM  indicates  that  an  exec  swap  has  occurred,  the  system  task 
marks  the  kernel  process  table  slot  as  SWAPPED.   If  the  swap  task  is  SWAPIDLE,  the 
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system  task  sends  a  COREISNEEDED  message  to  the  swap  task  to  indicate  that  a 
ninnable  process  exists  on  the  swap  device.  If  the  exec  message  indicates  that  a  normal 
exec  has  occurred  and  the  swap  task  status  is  SWAPIN,  a  COREJSFREE  message  is 
sent  to  the  swap  task.  This  means  that  an  exec  has  occurred  that  might  have  freed  some 
main  memory. 

•  When  a  process  exits,  MM  notifies  the  kernel  via  the  system  task  to  clean  up  the 
process's  process  table  slot  and  free  it  for  reuse.  If  the  swap  task  status  is  SWAPIN,  a 
COREISFREE  message  is  sent  to  the  swap  task  to  indicate  that  main  memory  has 
been  freed. 

3.4.3  Memory  Manager  -  Coordination 

3.4.3.1  Fork  The  fork  system  call  creates  a  new  process  by  copying  the  current  process's 
text,  data,  stack,  and  process  table  slot.  The  parent  and  child  receive  a  different  return  value 
to  enable  them  to  identify  themselves.  When  main  memory  is  allocated  for  the  child 
process,  the  allocation  routine  determines  whether  enough  memory  is  available.  If  enough 
contiguous  free  memory  is  available,  then  the  fork  can  continue.  If  free  memory  is 
available,  but  is  not  contiguous,  the  allocation  routine  compacts  main  memory  (i.e., 
relocates  the  process  images  in  main  memory  to  create  a  large  contiguous  free  memory 
space  in  high  memory).  Compaction  was  implemented  to  serve  two  purposes: 

•  reduce  the  number  of  swap  outs 

•  simplify  the  swap  in/out  algorithms. 
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In  either  case,  the  memory  required  for  the  child  process  is  allocated.  However,  if  the 
required  main  memory  is  not  available,  the  fork  swap  out  is  begun.  It  calls  the  swapout 
procedure  (see  "Swap  Out  Function")  to  copy  the  parent's  process  image  to  the  swap 
device.  It  marks  the  MM  process  table  as  FKSWAPPED  (fork  swapped),  notifies  the  kernel 
that  a  fork  swap  has  occurred  (see  "System  Task"),  and  notifies  the  kernel  of  the  child's 
memory  map  for  swap  in. 

3.4.3.2  Exec  The  MM  exec  system  call  overlays  the  existing  program  with  a  new 
program.  (The  program  changes  while  the  process  remains.)  The  new  program  inherits  the 
environment  of  the  calling  program.  When  main  memory  is  allocated  for  the  new  program, 
the  allocation  routine  determines  whether  enough  memory  is  available.  The  memory  space 
of  the  current  process  is  included  in  the  free  memory  evaluation.  If  enough  memory  is 
available,  then  the  exec  can  continue.  If  enough  memory  is  available,  but  is  not  contiguous, 
then  the  memory  of  the  existing  process  is  freed,  main  memory  is  compacted,  and  the 
memory  required  for  the  new  program  is  allocated. 

If  the  required  main  memory  is  not  available,  the  exec  swap  out  is  begun.  The  new 
program  text  and  data  are  read  from  the  executable  file  on  disk  and  written  to  the  swap 
device.  The  stack  is  adjusted  and  also  written  to  the  swap  file.  The  existing  memory  is 
freed,  the  MM  process  slot  is  marked  as  SWAPPED,  and  the  kernel  is  notified  that  an  exec 
swap  has  occurred  (see  "System  Task"). 
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3.4.3.3  Paused  Process  Handling  There  are  two  cases  to  consider: 

•  when  any  process  issues  the  pause  system  call 

•  and  when  a  pause  swapped  process  becomes  unpaused. 

When  a  process  issues  the  pause  system  call,  it  is  blocked  until  the  system  call  completes 
and  it  expects  no  I/O.  This  makes  it  an  excellent  candidate  to  swap  out  when  space  is 
needed  in  main  memory.  If,  at  the  same  time,  a  runnable  process  is  on  the  swap  device 
waiting  to  be  swapped  in,  a  COREISFREE  message  is  sent  to  the  swap  task.  This  means 
that  a  process  is  available  for  swap  out. 

A  process  leaves  the  pause  state  upon  reception  of  a  signal.  If  the  process  is  swapped  out, 
and  there  are  no  other  runnable  processes  on  the  swap  device,  a  COREISNEEDED 
message  is  sent  to  the  swap  task.  The  swap  task  marks  the  kernel  process  table  as  not 
BLOCKED.  After  the  process  is  swapped  in,  the  expected  EINTR  result  value  is  returned 
to  the  calling  process. 

3.4.3.4  Wait  Process  Handling  There  are  three  cases  to  consider: 

•  when  a  process  issues  the  wait  system  call 

•  when  a  waiting,  swapped  process  is  unblocked  by  a  signal 

•  when  a  waiting,  swapped  process  receives  a  death  of  child  signal. 

The  first  two  cases  are  the  same  as  those  described  for  pause  process  handling. 
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The  death  of  child  requires  a  little  more  handling.  Death  of  child  requires  that  the  waiting 
process  receive  the  exit  status  of  the  dead  child  process.  This  is  accommodated  for 
swapped  processes  by  postponing  the  process  slot  cleanup  of  the  dead  process,  marking  it  as 
HANGING  (i.e.,  a  zombie),  freeing  the  dead  child's  memory,  and  saving  the  dead  child's 
process  slot  number  in  the  parent's  process  table  slot.  When  the  process  is  swapped  in,  the 
dead  child's  exit  status  is  returned  to  the  parent  and  the  process  slot  cleanup  is  performed. 

3.43.5  Signal  Handling  If  a  signal  is  sent  to  a  swapped  out  process  it  is  saved  until  the 
process  has  been  swapped  in.  If  the  process  is  BLOCKED,  it  is  marked  not  BLOCKED  in 
the  kernel  process  table  and  a  COREISNEEDED  message  is  sent  to  the  swap  task  which 
will  eventually  start  the  swap  in  for  this  process.  All  signals  received  are  stored  in  the 
kernel  process  table.  When  the  process  is  swapped  in  and  before  it  is  marked  ready  to  run, 
all  pending  signals  are  applied.  Applicable  signal  actions  are: 

•  terminate  the  process  -  the  user  process  is  terminated, 

•  ignore  the  signal  -  the  signal  is  ignored, 

•  and  perform  a  function  -  the  user  process  stack  is  adjusted  to  set  up  a  call  to  the  signal 
handling  function.  When  the  process  runs  the  signal  handling  function  will  be  the  first 
thing  performed. 

3.43.6  Freed  Memory  Handling  Main  core  is  freed  by  the  exit  system  call  and 
sometimes  by  the  exec  system  call  (i.e.,  execing  a  program  with  a  smaller  process  memory 
image).  The  system  task  sysxit  and  sysexec  functions  determine  if  a  runnable  process  is 
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on  the  swap  device  when  either  of  the  above  occurs.  If  so,  then  they  send  a 
COREISFREE  message  to  the  swap  task. 

3.4.3.7  Swap  Out  Function  Preparation  for  physical  swapping  of  a  process  image  to  the 
swap  device  is  performed  by  MM.  This  occurs  two  ways,  fork  swap  and  forced  swap  out. 
(exec  swaps  are  a  little  different  because  the  process  image  is  in  the  executable  file,  not  in 
main  memory.)  The  swap  task  sends  a  SWAPOUTREQUEST  message  to  the 
doswapoutO  function.  The  process  is  marked  as  SWAPPED.  The  location  and  size  of 
the  process  segments  are  determined.  A  file  name  for  the  process  is  generated.  The  file  is 
created  and  the  process  segments  are  written  to  the  swap  device.  If  the  request  is  from  the 
swap  task,  main  memory  is  freed  and  a  SWAPOUTCOMPLETE  message  is  sent  to  the 
swap  task. 

3.4.3.8  Swap  In  Function  Preparation  for  physical  swapping  of  a  process  image  from  the 
swap  device  is  performed  by  MM.  The  swap  task  sends  a  SWAPINREQUEST  message 
to  the  doswapinO  function  which  determines  if  there  is  enough  main  memory  for  the 
incoming  process.  If  main  memory  is  not  available,  then  the  function  makes  a  list  of  all 
processes  that  are  blocked  on  the  pause  and  wait  system  calls.  It  sends  this  list  along  with 
the  size  of  main  memory  needed  and  the  SWAPJNFAILED  message  to  the  swap  task. 
This  aids  the  swap  task  in  choosing  a  process  for  swap  out. 

If  there  is  enough  free  main  memory,  then  the  swap  in  proceeds.  If  enough  free  memory 
exists  but  is  not  contiguous,  then  the  allocation  routine  compacts  memory,  as  described  for 
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fork  and  exec,  and  the  swap  in  proceeds.  The  memory  location  and  size  of  each  process 
segment  are  determined.  The  file  name  for  the  process  is  re-generated.  The  file  is  opened 
and  the  process  segments  are  read  from  the  swap  device  and  copied  to  main  memory.  The 
process  is  marked  as  not  SWAPPED  and  the  swap  file  is  removed.  If  the  process  was 
swapped  out  as  a  result  of  a  fork,  then  a  wakeup  message  is  sent  to  the  process  just  as  in  a 
normal  fork.  All  signals  that  were  received  by  the  process  while  swapped  out  are  applied. 
If  the  process  was  pause  or  wait  and  was  interrupted  by  a  signal,  then  the  EINTR  result  is 
sent  to  the  process.  If  the  process  was  wait  and  was  awakened  by  the  death  of  a  child,  then 
the  child's  exit  status  is  sent  to  the  process  and  the  dead  process's  process  table  slot  is 
cleaned  up.  The  SWAPINCOMPLETE  message  is  then  sent  to  the  swap  task. 

3.4.4  File  System  -  Swap  I/O 

3.4.5  Swap  Device  The  swap  device  is  the  area  on  secondary  memory  that  is  used  to  store 
process  images  that  have  been  swapped  out  of  main  memory.  Swap  device  input/output 
operations  should  be  efficient  to  minimize  system  performance  degradation.  Typically, 
space  on  a  swap  device  is  allocated  in  multiblock  segments  so  that  the  swapped  process 
images  are  written  and  read  contiguously.  Contiguous,  multiblock  reads  and  writes  are 
quicker  than  normal  file  system  block-by-block  operations.  When  a  process  image  is 
swapped  into  main  memory,  its  storage  space  on  the  swap  device  is  freed.  Space 
management  routines  should  be  used  to  manage  fragmentation  that  occurs  on  swap  devices. 
Because  the  allocation  scheme  for  the  swap  device  differs  from  the  normal  file  allocation 
scheme,  data  structures  that  catalog  free  space  differ  too. 
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The  above  requirements  should  ensure  good  swap  device  input/output  performance  but  they 
also  exact  a  price:  additional  code  and  complexity  to  perform  and  maintain  that  function. 
On  a  system,  such  as  MINIX,  where  the  main  memory  for  user  processes  is  limited, 
additional  code  in  the  operating  system  will  further  limit  that  memory. 

This  design  uses  an  approach  which  sacrifices  efficiency  for  simplicity.  The  existing  file 
system  is  used  to  manage  the  swapping  input/output  operations.  The  swap  device, 
therefore,  consists  of  a  normal  file  system  directory  that  is  protected  (with  normal  file 
system  protection  mechanisms)  from  all  users  except  root.  The  file  system  handles  all 
swapping  input/output  as  it  handles  all  other  file  system  input/output.  No  special 
contiguous,  multiblock  input/output  operations  or  swap  device  space  management 
procedures  are  required.  Normal  creat,  open,  read,  write,  seek,  close,  and  unlink  system 
calls  are  used.  The  advantages  are:  simple  implementation,  simple  administration,  and  no 
large  operations  overhead.  Disadvantages  are:  slow  swapping  input/output  operations  and 
conflicts  with  the  file  system.  (When  the  file  system  is  swapping,  it  is  not  free  to  do  useful 
user  program  input/output.) 

The  file  system  manages  the  swap  device  as  it  does  other  regular  files.  The  only  change  to 
the  file  system  allows  MM  to  go  to  the  swap  directory  directly.  The  inode  of  the  swap 
directory  is  determined  and  saved  by  the  first  access  to  the  swap  directory.  Subsequent 
accesses  use  the  saved  information. 
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CHAPTER  4 


4.  IMPLEMENTATION 

4.1  Introduction 

This  description  of  the  MTNIX  swapping  implementation  shows  how  the  design  elements 
presented  in  Chapter  3  are  implemented.  It  assumes  a  knowledge  of  the  MINIX  version  1.3 
operating  system  and  the  design  described  in  Chapter  3.  All  swapping  code  is  written  in 
the  C  programming  language  just  as  most  of  MINIX.  The  code  itself  is  commented,  so  the 
descriptions  here  will  be  brief.  The  description  is  organized  into  3  parts:  kernel,  MM,  and 
FS.  The  code  is  listed  in  the  appendix3.  New  programs  and  functions  are  listed  in  their 
entirety.  All  C  functions  that  have  been  altered  are  listed  in  their  entirety.  Only  changes  to 
header  (.h)  files  are  listed. 

4  J  Kernel  Code 
4.2.1   clock.c 

4.2.1.1  do_ctocktick  Lines  64  through  69  update  the  kernel  process  table  residence  times 
for  each  active  process  by  a  simple  increment.  Residence  times  are  reset  to  0  when  a 
process  is  created,  moved  to  the  swap  device,  and  swapped  into  main  memory.    The 


3.    The  original  portions  of  the  code,  copyrighted  by  A.  S.  Tanenbaum,  are  reprinted  with  his  permission  with 
the  restriction  that  reproduction  be  limited. 
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residence  time  variable  is  an  unsigned  integer  with  a  maximum  value  of  65535.  The  value 
will  turnover  to  0  after  about  18  hours;  this  is  not  considered  to  be  a  problem. 

When  there  is  a  runnable  process  on  the  swap  device,  lines  70  through  75  send  a 
COREISFREE  message  to  the  swap  task  as  a  notification  that  process  residence  times 
have  changed.  The  swap  task  might  now  be  able  to  choose  a  process  for  swap  out. 

4.2.2  procc 

4.2.2.1  minirec  Lines  53  and  54  were  changed  to  check  for  a  pending  message  from  the 
swap  task  to  MM.  See  discussion  about  non-blocking  message  transfer  in  "Kernel  Code, 
swapper.c".  If  MM  is  waiting  for  a  message,  and  a  message  from  the  swap  task  is  ready, 
then  the  inform()  function  is  called  to  deliver  the  message. 

4.2.3  swapper.c 

4.2.3.1  swaptask  Upon  entry,  the  initswapO  function  is  called  to  set  the  initial  state  for 
the  swap  task.  Then,  as  with  other  tasks,  a  large  case  statement  is  entered  to  handle  all 
messages  that  can  be  received. 

COREISFREE  (line  88)  corresponds  to  the  detailed  design  description.  The 
trytoswinO  function  checks  for  runnable  processes  on  the  swap  device,  picks  the  most 
eligible,  and  sends  a  swap  in  request  to  MM. 

COREISNEEDED  (line  94)  first  determines  whether  the  message  indicates  that  a 
swapped  process  has  become  unblocked.   If  so,  the  kernel  process  table  slot  is  marked  as 
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not  BLOCKED.  If  swapstat  is  SWAPIDLE,  then  a  swap  in  is  attempted. 

SWAPJDMCOMPL  (line  105)  indicates  that  the  requested  swap  in  has  successfully 
completed.  The  process  table  is  updated  and  the  process  is  made  ready  to  run.  The  swap 
device  is  checked  for  more  runnable  processes. 

SWAPINF  AILED  (line  118)  indicates  that  the  requested  swap  in  has  failed.  MM  has 
sent  the  size  of  core  needed  and  a  list  of  all  processes  blocked  on  the  pause()  and  wait() 
system  calls.  MM  is  blocked  awaiting  a  reply  to  this  message,  the  pik_outsw()  function  is 
called  to  choose  a  process  for  swap  out  and  a  message  is  sent  to  MM  to  request  the  swap 
out.  If  no  eligible  process  is  chosen  for  swap  out,  the  swap  task  responds  to  MM  with 
NOSWAPOUT. 

SWAPOUTCOMPL  (line  139)  indicates  that  the  requested  swap  out  has  completed.  The 
NO_MAP  flag  is  set  and  the  BLOCKED  flag  is  maintained  by  the  kernel  to  indicate 
whether  a  swapped  process  is  blocked  or  runnable  (not  blocked). 

SWAPOUTFAILED  (line  152)  indicates  that  the  requested  swap  out  has  failed.  No 
action  is  performed  until  another  message  is  received. 

4.2.3.2  trytoswin  This  function  checks  for  runnable  processes  on  the  swap  device 
(process  table  flags  of  SWAPPED  and  BLOCKED),  chooses  the  most  eligible,  and  requests 
MM  to  perform  the  swap  in. 
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4.2.3.3  pikinsw  This  function  returns  the  process  slot  number  of  the  oldest,  runnable 
process  on  the  swap  device. 

4.2.3.4  pik  outsw  This  function  returns  the  process  slot  number  of  the  most  eligible 
process  to  swap  out.  It  does  this  by  assigning  a  value  to  each  process  based  on  a  number 
of  process  attributes.  An  initial  priority  value  of  0  is  assigned  to  each  process.  If  the 
process  is  at  least  as  large  as  the  size  needed,  then  P1CKSIZE  (10000)  is  added  to  the 
priority  value.  If  it  is  blocked  on  the  pauseO  or  waitO  system  call  and  has  been  resident  for 
the  minimal  interval  CRESMIN  (15  seconds),  then  PICKPWBLK  (100)  is  assigned.  If  it  is 
not  blocked  at  all  and  has  been  resident  for  CRESMIN,  then  PICKNBLK  (1)  is  added.  If  it 
is  ineligible  for  swap  out,  PICKINELL  (-20000)  is  added.  The  process  with  the  highest 
priority  value  above  0  is  returned. 

4.2.3.5  setswapproc  A  task  is  a  high  priority  process  and  MM  is  a  medium  priority 
process.  A  process  should  send  a  message  directly  to  a  lower  priority  process  only  when  it 
is  known  that  the  lower  priority  process  is  waiting  for  a  message.  It  is  not  always  known 
what  MM  is  doing,  so  a  non-blocking  message  transfer  has  been  devised. 

To  send  a  non-blocking  message  to  MM,  such  as  for  a  SWAPINREQUEST,  a  special 
kernel  global  structure  was  created  called  swap_proc.  It  contains  a  status  which  can  be 
either  MPENDING  (a  message  is  waiting  to  be  sent  to  MM)  or  NOMES,  and  a  message 
pointer.  This  function  builds  the  message,  sets  the  status  to  MPENDING,  and  sets  the 
message  pointer.   The  informQ  function  is  called.   If  MM  is  waiting,  the  message  will  be 
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delivered  immediately.  If  not,  then  it  is  delayed,  but  the  swap  task  is  not  blocked. 
Whenever  MM  requests  a  RECEIVE  for  ANY,  the  minirecO  function  (in  proc.c)  checks 
for  a  non-blocking  message  from  the  kernel  and  the  message  is  eventually  delivered. 

4.2.4   system.c 

4.2.4.1  dofork  Line  39  sets  residence  times  for  all  forked  processes  to  0,  whether 
swapped  or  not. 

Lines  42  through  50  are  performed  when  a  fork  swap  has  occurred.  The  process  table  is 
marked  and  if  the  swap  task  is  idle,  a  COREISNEEDED  message  is  sent.  The  PROC1 
message  variable  is  set  to  0  to  indicate  that  the  process  is  not  blocked. 

4.2.4.2  donewmap  When  an  exec  swap  occurs,  the  main  memory  space  of  the  execing 
process  is  used  as  a  data  buffer.  Data  is  copied  from  the  executable  file  to  this  main 
memory  area  and  then  written  to  the  swap  file.  (See  MM  exec.c  for  more  details.)  To  set 
up  this  memory  area  as  a  buffer,  exec  changes  the  memory  map  of  the  execing  process  by 
calling  the  sysnewmapO  function.  Lines  99  through  105  check  for  this  situation  and  set 
the  process  table  flag  to  NOMAP  to  indicate  that. 

4.2.4  J  doexec  If  an  exec  swap  has  occurred,  the  exswap  flag  is  set.  Lines  133  through 
140  set  the  process  table  to  SWAPPED  and  send  a  CORE  IS_NEEDED  message  to  the 
swap  task  if  it  is  SWAPIDLE. 

If  an  exec  has  occurred  without  a  swap  out  and  the  swap  task  status  is  SWAPDM,  lines  144 
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through  146  send  a  COREJSFREE  message  to  the  swap  task. 

4.2.4.4  doxit  When  a  process  dies,  main  memory  is  freed.  If  the  swap  task  status  is 
SWAPIN,  then  lines  212  through  215  send  a  COREIS  FREE  message  to  the  swap  task. 

4.2.4.5  dojock  This  is  a  new  function  that  allows  MM  to  turn  on/off  hardware  interrupts. 
It  is  used  when  MM  is  performing  memory  compaction. 

4.2.4.6  inform  If  a  non-blocking  message  from  the  swap  task  to  MM  is  queued,  then  lines 
254  through  259  will  send  the  message  immediately  (see  setswapproc). 

4J  MM  Code 
4.3.1    allocc 

4.3.1.1  allocmem  This  function  has  been  changed,  such  that,  if  a  request  for  contiguous 
memory  cannot  be  satisfied,  but  enough  fragmented  free  memory  exists,  then  the  entire 
memory  area  is  compacted  to  satisfy  the  request. 

4.3.1.2  tot  hole  This  is  a  new  function  that  calculates  the  sum  of  all  free  main  memory. 
It  replaces  the  maxholeO  function  which  returned  the  largest  hole  of  free  memory. 
Because  compaction  is  used,  the  total  free  memory  size  is  more  useful  than  just  the  largest 
hole  size. 
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4.3.1.3  compact  This  is  a  new  function  that  performs  the  memory  compaction.  It  begins 
by  calling  the  syslockO  function  to  rum  off  all  system  hardware  interrupts  so  that  no  I/O 
transfers  are  done  while  a  process  image  is  being  moved  to  another  memory  location.  A 
loop  is  entered  where  the  first  free  hole  is  found.  The  size  of  the  process  just  above  the 
hole  is  calculated  and  then  the  process  image  is  copied  into  the  hole.  The  process'  original 
memory  is  freed  and  core  is  allocated  for  the  process'  new  location.  The  process  table 
memory  map  is  adjusted.  The  loop  repeats  until  there  is  only  1  hole  remaining.  The 
hardware  interrupts  are  then  enabled. 

4.3.2   exec.c 

43.2.1  doexec  This  function  has  been  changed  to  add  the  exec  swap  out  feature.  The 
call  to  the  newmemO  function  on  line  63  returns  EXSWAPD  if  main  memory  is  not 
available  for  the  execed  program.  NewmemO  also  saves  a  copy  of  the  existing  memory 
map  for  this  process. 

If  a  swap  out  is  necessary,  lines  93  through  185  are  performed.  First  the  swap  file  is 
created  by  changing  to  the  swap  directory,  checking  access  permissions,  and  creating  the 
swap  file.  Any  failures  up  to  this  point  result  in  an  ERROR  return  value  which  indicates 
that  the  program  could  not  be  execed.  The  user  process  can  handle  that  error  any  way  that 
it  chooses.  Failures  beyond  this  point  result  in  system  errors  because  the  execing  process  is 
destroyed. 

To  put  the  execed  program  onto  the  swap  device,  the  entire  execing  process  memory  area 
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(up  to  2047  blocks)  is  converted  to  a  temporary  data  buffer.  The  sys  newmapO  function  is 
called  to  notify  the  kernel  of  the  change.  The  third  parameter,  TRUE,  indicates  that  this 
will  be  a  data  buffer  and  that  the  system  task  should  mark  the  kernel  process  table  with  the 
NOMAP  flag  to  prevent  this  process  from  being  chosen  to  run. 

The  executable  program  is  then  copied  to  the  swap  device  within  the  loop  beginning  at  line 
136.  The  text  segment  is  copied  into  the  data  buffer  of  the  execing  process  via  the 
loadsegO  function  and  then  written  to  the  swap  file  via  the  special  version  of  the  write() 
system  call  in  which  FS  copies  data  directly  from  the  user  process  memory  area  to  the  disk 
file  without  going  through  MM.  After  the  segment  has  been  completely  copied,  the  file  size 
is  increased  so  that  it  is  equal  to  the  length  of  the  segment  as  indicated  in  the  MM  process 
table  memory  map.  This  makes  swap  in  much  easier.  The  loop  is  repeated  for  the  data 
segment. 

The  stack  is  created  from  the  original  stack  and  written  to  the  swap  file  in  the  same  manner 
as  above. 

The  memory  area  is  freed  and  the  MM  process  table  is  marked  as  SWAPPED.  The  call  to 
the  sysexecO  function  tells  the  kernel  (swap  task)  about  the  exec  and  possible  swap  out 
(see  Kernel  Code  -  system.c  -  doexec). 

The  fork  swap  out  and  forced  swap  out  both  call  the  swapoutO  function  in  mswap.c.  They 
copy  process  images  from  main  memory  to  a  swap  file.  The  exec  swap  does  not  call 
swapoutO  because  it  must  copy  the  program  from  the  executable  file,  transform  the  data 
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into  a  process  image,  and  then  write  it  to  the  swap  file.  In  addition,  the  amount  of  memory 
used  by  the  execing  process  is  used  as  a  buffer  for  the  file  transfer  and  transformation  and 
might  not  be  large  enough  for  the  entire  new  program.  So  it  must  be  done  piecemeal. 
Rather  than  add  this  complexity  to  the  swapoutO  function,  it  was  included  here. 

4.3.2.2  new_mem  This  function  has  been  changed,  lines  268  through  281,  to  check  for 
total  free  memory  rather  than  just  largest  contiguous  free  memory  hole.  It  also  includes  the 
size  of  the  execing  process  image  in  the  free  memory  calculation.  If  memory  is  not 
available,  newmemO  generates  the  memory  map  for  the  new  program  in  a  temporary 
variable  and  returns  EXSWAPD  to  indicate  that  an  exec  swap  out  should  be  done. 

The  size  of  the  execing  process  image  is  also  calculated.  If  a  swap  out  is  necessary,  the 
size  will  be  used  to  determine  how  large  the  data  buffer  can  be. 

43.2.3  load  seg  The  new  input  parameter  "usr"  Qine  340)  allows  the  calling  function  to 
specify  the  user  process  whose  memory  area  is  being  loaded.  The  previous  version 
defaulted  to  the  execing  program.  The  new  parameter  is  useful  for  the  swap_inO  function 
in  mswap.c. 

4.3.3  forkexit.c 

433.1  dofork  This  function  has  been  changed  to  add  the  fork  swap  out  feature  and  to 
use  the  memory  compaction  feature  to  reduce  the  number  of  fork  swap  outs.  The  check  for 
free  memory,  lines  38  through  45,  looks  at  total  free  memory  rather  than  the  largest 
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contiguous  free  memory  area.  If  memory  is  not  available,  then  a  fork  swap  out  must  be 
done.  The  memory  map  is  adjusted,  if  necessary.  The  user  program  controls  the  program 
stack  pointer  and  can  increase  the  stack  size  beyond  the  stack  region.  MINIX  is  not  aware 
of  it  until  the  program  requests  more  data  space.  When  that  occurs,  MINIX  checks  the 
stack  pointer  and  adjusts  the  stack  region  size  in  the  memory  map  to  correspond  to  the  new 
stack  pointer.  The  adjustO  function  verifies  and  adjusts,  if  necessary,  that  the  stack  pointer 
is  within  the  stack  region  as  defined  in  the  memory  map.  The  memory  map  is  used  as  a 
measure  of  the  process  image  size  for  swap  out  and  swap  in. 

Lines  77  through  81  call  the  swapout()  function  to  copy  the  parent's  image  to  the  swap  file. 

Lines  86  and  87  set  the  new  MM  process  table  variable  to  0. 

Line  109  notifies  the  kernel  (via  the  system  task)  as  to  the  status  of  the  fork.  If  a  fork  swap 
has  occurred,  the  system  task  is  not  notified  of  the  childs  new  map  nor  is  the  reply  returned 
to  the  child  (because  it  is  swapped  out).   It  will  be  returned  after  the  child  is  swapped  in. 

4332  mmexit  This  function  has  been  changed,  lines  164  through  173,  to  check  for 
waiting  parents  that  are  swapped  out.  If  so,  then  the  cleanup  of  the  process  table  slot  and 
return  of  the  exit  status  to  the  parent  is  delayed  until  the  parent  is  swapped  in.  The  memory 
occupied  by  the  exiting  process  is  freed  in  either  case. 

4333  dowait  This  function  has  been  changed,  lines  225  through  234,  to  check  for 
processes  on  the  swap  device.   If  so,  then  a  CORE_IS_FREE  message  is  sent  to  the  swap 
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task  to  indicate  that  the  process  that  has  just  called  wait()  is  eligible  for  swap  out. 

4.3.4  main.c 

43.4.1  main  In  the  call  to  the  service  functions  on  line  27,  a  dummy  parameter  (0)  was 
added.  This  allows  the  service  functions,  specifically  doswoutO,  to  be  called  with 
parameters  by  other  functions  (see  "MM  Code,  doswin"). 

4.3.4.2  reply  Line  52  was  added  to  exclude  the  swap  task  from  the  validation  check 
because  MM  does  send  replies  directly  to  the  swap  task  and  the  swap  task  is  not  in  the  MM 
process  table. 

4.3.5  mswap.c  This  new  file  contains  procedures  for  swap  out  and  swap  in. 

4.3.5.1  doswout  This  function  is  called  from  mainO  when  the  SWAPOUTREQUEST 
is  received  from  the  swap  task.  It  is  also  called  from  do_swin()  after  a  swap  in  fails  and 
the  swap  task  suggests  a  process  for  swap  out.  It  starts  by  getting  the  number  of  and 
pointer  to  the  slot  of  the  process  to  be  swapped  out.  The  swapoutO  function  is  called  to 
perform  the  swap  out.  If  the  swap  out  succeeds,  the  process  slot  is  marked  as  SWAPPED, 
and  if  the  process  is  blocked  on  the  pauseO  or  waitO  system  call,  the  PROC2  message 
variable  is  set  to  TRUE.  The  SWAPOUTCOMPLETE  message  is  sent  to  the  swap  task. 
If  the  swap  out  failed,  the  SWAP_OUT_FAILED  message  is  returned. 
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4.3.5.2  swapout  This  function  is  called  from  the  do_swoutO  function  for  a  forced  swap 
out  and  from  the  doforkO  function  for  a  fork  swap  out  The  big  difference  between  the 
two  is  that  the  fork  swap  does  not  want  to  remove  the  parent  process  image  from  main 
memory,  but  the  forced  swap  out  does.  The  clearmem  variable  is  FALSE  for  the  former 
and  TRUE  for  the  latter.  The  dump_core()  function  is  called  to  perform  the  actual  swapout. 
If  it  fails,  an  error  message  is  returned  to  the  caller.  If  it  succeeds,  and  if  it  is  a  forced 
swap  out,  then  the  memory  of  the  process  is  freed. 

4.3.5.3  doswin  This  function  is  called  from  main()  when  the  swap  task  sends  a 
SWAPJNREQUEST  message.  do_swin()  gets  the  process  table  slot  number  of  the 
process  to  swap  in,  calculates  the  amount  of  memory  needed,  and  tries  to  allocate  it.  If 
memory  is  available,  then  do_swin()  marks  the  process  as  not  SWAPPED  and  calls  the 
swap_in()  function  to  perform  the  swap  in.  After  swap  in  completion,  many  items  are 
checked  and  some  cleanup  chores  are  performed. 

If  the  process  was  fork  swapped,  then  a  reply  of  0  is  sent  to  indicate  that  it  is  a  child  of  the 
fork. 

If  any  signals  were  sent  to  the  process  while  it  was  swapped  out,  then  they  must  be  applied 
to  the  process.  (The  actual  signal  handling  will  be  done  when  the  process  is  put  into  the 
RUNNING  state.)  During  swap  out,  all  received  signals  are  stored  in  the  mpsswmap 
process  table  variable.  For  each  signal  that  was  received,  the  sig_proc()  function  is  called 
to  process  it. 
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If  a  process  was  blocked  on  the  wait()  system  call  and  one  of  its  child  processes  died,  then 
the  zombie  child  can  now  be  laid  to  rest,  its  process  table  slot  freed,  and  its  exit  status 
returned  to  its  parent.  All  done  by  the  cleanupO  function.  The  slot  number  of  the 
HANGING  child  is  saved  in  the  mpdeadchild  process  table  variable. 

If  a  process  was  blocked  by  a  pauseO  or  waitO  system  call  and  a  signal(s)  was  sent  to  the 
process  to  wake  it  up,  then  the  EINTR  (i.e.,  system  call  interrupted)  error  value  must  be 
returned  to  the  process. 

The  SWAPINCOMPLETE  message  is  returned  to  the  swap  task. 

If  memory  is  not  available  for  the  swap  in,  then  this  function  gathers  knowledge  known 
only  by  MM  and  sends  it  to  the  swap  task.  With  this  knowledge,  the  swap  task  can  make  a 
more  informed  choice  of  which  process  to  select  for  swap  out.  It  gathers  the  slot  numbers 
of  all  processes  in  core  that  are  blocked  on  the  pause()  and  waitO  system  calls  and  stuffs 
them  bitwise  into  a  32-bits  variable  (LONG1).  It  also  calculates  the  amount  of  main 
memory  needed  for  the  swap  in  (PROC1). 

It  is  necessary  to  freeze  the  user  processes  at  this  point  so  that  they  cannot  progress  and 
change  their  pause/wait  status.  So  the  SWAPJN  FAIL  message  is  sent  to  the  swap  task 
using  the  sendrecO  function.  MM  blocks  on  RECEIVE  from  the  swap  task.  If  the  swap 
task  cannot  choose  a  process  for  swap  out,  then  no  action  is  taken.  If  the  swap  task 
chooses  a  process  for  swap  out,  the  doswoutQ  function  is  called  to  perform  the  task. 
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4.3.5.4  swapin  This  function  is  called  by  the  doswinf)  function.  It  generates  the  swap 
file  name  from  the  process  slot  number  and  opens  the  file.  The  memory  map  is  changed  to 
reflect  the  new  base  address  in  main  memory.  The  gap  location  is  initialized  to  zeros  for 
security.  Each  segment,  text,  data,  and  stack  is  copied  from  the  swap  file  via  the  loadsegO 
function.  At  completion,  the  swap  file  is  removed. 

4.3.6   signalx 

4.3.6.1  checksig  Lines  56  through  61  were  added  to  handle  swapped  out  processes  to 
which  a  signal  was  sent.  The  sig_procO  function  is  delayed  until  the  process  is  swapped  in. 
The  fact  that  a  signal  was  sent  is  recorded  bitwise  in  the  process  table  variable 
mpsswmap.  Note  that  the  unpause()  function  is  called  just  after  this. 

4.3.6.2  unpause  Lines  117  through  121  check  for  swapped  processes  that  are  also  paused. 
If  so,  the  process  table  is  marked  as  WASPWS  (was  pause/wait  and  a  signal  was  received). 

The  process  is  now  runnable,  so  a  COREIS  NEEDED  message  is  sent  to  the  swap  task 
with  the  process  table  slot  number  of  the  awakened  process.  The  swap  task  will  mark  the 
process  table  as  not  BLOCKED  and  attempt  to  swap  in  the  process.  Lines  129  through  133 
accomplish  the  same  thing  for  waiting  processes. 

43.6.3  do_pause  Lines  85  through  92  send  a  CORE_IS_FREE  message  to  the  swap  task 
to  indicate  that  a  process  has  blocked  itself  and  is  available  for  swap  out  if  necessary. 
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43.6.4  dumpcore  This  function,  as  its  name  implies,  was  used  only  for  writing  the 
process  image  to  a  file  on  disk.  It  has  been  altered  to  serve  another  purpose:  swap  a 
process  image  out  to  the  swap  device.  It  just  so  happens  that  these  two  functions  are 
similar  enough  to  handle  them  in  a  general  manner.  Changes  were  made  throughout  the 
function  so  it  will  be  described  in  its  entirety. 

The  input  parameter  "type"  is  0  for  a  dump  core  request,  1  for  a  swap  out,  and  2  for  a  fork 
swap  out.  The  function  changes  to  the  working  directory  of  the  user  process  for  dump  core 
or  the  swap  device  for  swap  outs.  The  file  is  checked  for  write  access  permission.  For 
swap  out,  the  swap  file  name  is  generated  from  the  process  table  slot  number.  The  file  is 
created;  if  it  fails,  an  error  is  returned. 

The  memory  map  is  checked  for  accuracy  (i.e.,  is  the  stack  pointer  within  the  stack  region?) 
and  adjusted,  if  necessary. 

For  core  dumps,  the  memory  map  is  written  to  the  core  file.  Each  process  segment,  text, 
data,  and  stack  is  written  to  the  core  file  or  swap  file  (as  appropriate).  The  writeO  system 
call  to  FS  is  special  in  that  the  data  is  copied  directly  from  the  user  memory  area  and 
written  to  disk  without  going  through  the  MM. 

4.4  FSCode 
4.4.1   main.c 
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4.4.1.1  fsinit  A  new  variable,  swapnode  is  declared  and  initialized  to  NIL  on  lines  10 
and  29.  It  is  used  to  store  the  inode  of  the  swap  directory. 

4.4.2   stadir.c 

4.4.2.1  dochdir  Lines  38  through  49  were  added  to  allow  MM  to  quickly  change  to  the 
swap  directory  when  the  cd_flag  is  2.  If  the  directory  has  not  been  opened  yet,  then  the 
changeO  function  is  called  to  get  the  inode  and  store  it  in  the  swapnode  variable. 

4.4.2.2  change  Lines  80  through  83  explicitly  set  the  user_path  to  the  name_ptr  if  the 
cd  flag  is  2. 
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CHAPTER  S 

5.  CONCLUSIONS 
5.1   Results 

The  implementation  of  swapping  satisfies  the  requirements  of  Chapter  2: 

•  It  successfully  extends  the  limited  memory  of  the  system.  The  CPU  is  more  fully 
utilized  and  user  frustration  is  reduced. 

•  All  existing  functions  are  maintained;  no  new  functionality  was  added. 

•  The  MTN1X  structure  is  maintained;  related  extensions  are  suggested  below. 

•  Installation  is  simple.  It  requires  installing  a  protected  directory  "Aisr/swap". 
Maintenance  requires  ensuring  that  enough  secondary  memory  is  available  for  the  swap 
device. 

•  Performance  is  acceptable.  It  is  not  degraded  when  swapping  is  not  in  use  and  shows 
acceptable  degradation  when  in  light  use.  Table  5-1  shows  benchmarks  of  system 
performance  under  various  conditions.  The  benchmarks  are  based  on  compiles  of  the 
MTNTX  kernel,  MM,  and  FS  using  the  MTNIX  supplied  compiler  and  makefiles.  Note 
that  there  are  no  perceptible  compile  time  differences  between  MINIX  version  1.3 
without  swapping  and  version  1.3  with  swapping  when  no  swapping  occurs.  When 
swapping  occurs,  the  performance  starts  to  suffer.  The  AT  class  machine  takes  about 
23%  more  time  to  concurrently  compile  the  kernel  and  MM  while  the  XT  class  only 
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requires  about  11%  more  time.  This  is  considered  light  to  medium  swapping  and 
because  the  compiles  are  CPU  intensive,  it  is  acceptable.  For  heavier  swapping,  the  AT 
class  machine  takes  about  52%  more  time  to  concurrently  compile  the  kernel,  MM,  and 
FS.  This  is  probably  exceeding  the  capacity  of  the  CPU  and  is  a  hardware  problem. 

The  size  of  the  MIN1X  process  memory  image  increased  by  10%  (about  12K  bytes). 
(Part  of  the  increase,  about  13%,  is  a  result  of  increasing  the  number  of  process  table 
slots  from  16  to  24  to  allow  more  processes  to  be  active.) 

Response  time  for  interactive  processes  suffers  when  CPU  bound  processes  are  running.  It 
becomes  more  prevalent  with  swapping  because  more  processes  can  be  run  concurrently. 
The  response  degradation  occurs  because  MINIX  assigns  the  same  priority  to  all  user 
processes.  A  new  process  scheduler  algorithm  is  required  to  remedy  this. 

5.2  Improvements  and  Further  Development 

The  implementation  provides  an  opportunity  to  devise,  experiment,  and  test  various 
scheduling  and  swapping  algorithms.  In  addition,  the  implementation  can  be  extended  in 
other  ways: 

•  User  process  priorities  can  be  established  to  assign  interactive  processes  a  higher  user 
priority  to  provide  immediate  response. 

•  Better  swap  out/in   algorithms   can   be   implemented,   possibly   integrating   process 
scheduling  with  process  swapping. 
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■  Shared  text  can  be  implemented  to  make  swapping  more  efficient. 

■  A  real  brk()  system  call  can  now  be  implemented  in  MINIX. 

The  exec  swap  out  can  be  optimized  to  eliminate  a  read  and  write  of  the  incoming 
program. 


A  more  efficient  swap  device  can  be  implemented. 


OS 

COMPILE  TASK 

HARDWARE 

AT 

XT 

MINIX 
1.3  v 

kernel 

7:43 

FS 

5:43 

MM 

3:22 

TOTAL 

16:48 

MINIX 
1.3v 

w/swapping 
not  used 

kernel 

7:45 

49:09 

FS 

5:53 

MM 

3:24 

20:02 

TOTAL 

17:02 

69:11 

MINIX 
1.3  v 

w/swapping 
in  use 

kernel  &  MM 

13:44 

75:42 

kernel,  MM,  &  FS 

25:38 

TABLE  5-1.  MINIX  NON-SWAPPING  vs  SWAPPING  BENCHMARKS 
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1 
2 

>  3 

>  4 

>  5 

>  6 

>  7 

>  8 
9 
10 

11 

12 

>  13 

>  14 

>  15 

>  16 

17 
18 
19 
20 

>  21 

>  22 

>  23 

>  24 
25 
26 
27 
28 

>  29 

>  30 

>  31 

>  32 


coniLh 

/*  swapjask  defines  •/ 
♦define  SWAP  IDLE  1 
♦define  SWAPJN 
♦define  SWAP_OUT  3 
♦define  NOMES 
♦define  MPENDING  5 


glo.h 


/*  nothing  to  swap  in  or  out  •/ 

2  /•  runnable  process  on  swap  device  •/ 

/*  swapjask  has  ordered  a  swap  out  */ 

4  I*  No  message  is  wailing  for  MM  */ 

I*  Message  is  waiting  for  MM  •/ 


/*  trace  display  •/ 

EXTERN  int  Mag; 

EXTERN  int  swap_stat; 

EXTERN  struct  sw_mm_mes  swapjjroc 


/•  if  =  0,  no  display,  else  auto  display  */ 


proc.h 

unsigned  rcs_time; 
♦define  SWAPPED 
♦define  BLOCKED 
♦define  STICKY 


/•  residence  time  in  seconds  (core,  swap)  V 
040  /*  process  is  on  swap  device  */ 

0100  /•  swapped  proc  is  blocked  */ 

0200  /•  proc  has  sticky  bit  set  */ 


type.h 

PUBLIC  struct  sw_mm  mes  { 
int  status; 
message  *ms; 

h 
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20 


26 
27 

28 


35 


39 


4  PRIVATE  dodocktickO 

5  { 

6  /*  This  routine  ii  called  on  every  clock  tick.  */ 
7 

8  register  struct  proc  *jp; 

9  register  int  t,  procjir, 

10  extern  int  pr_busy,  pcount,  cum_count,  prev  d; 
11 

12  /*  To  guard  against  race  conditions,  first  copy  'lostjicks'  to  a  local 

13  *  variable,  add  this  to  'realtime',  and  then  subtract  it  from  'lost  ticks'. 

14  */ 

15  i  =  lost_ricks;  /*  Tostticks'  counts  missed  interrupts  */ 

16  realtime +=  I  +  1;  /*  update  the  time  of  day  */ 

17  lostticks  -=  t;  /*  these  interrupts  are  no  longer  missed  */ 
18 

19  if  (next_alarm  <=  realtime)  { 

I*  An  alarm  may  have  gone  off,  but  proc  may  have  exited,  so  check.  */ 

21  nextjdarm  =  MAXPLONG;    /*  sun  computing  next  alarm  */ 

22  for  (rp  =  &proc[0];  rp  <  &proc[MR_TASKS+NR  PROCS] ;  rp++)  { 

23  if  (rp->p_alarm  !=  (real_time)  0)  { 

24  /•  See  if  this  alarm  time  has  been  reached.  */ 

25  if  (rp->p_alarm  <=  realtime)  { 

/*  A  timer  has  gone  off.    If  it  is  a  user  proc, 

*  send  it  a  signal.   If  it  is  a  task,  call  the 

*  function  previously  specified  by  the  task. 

»  */ 

3"  procjir  =  rp  •  proc  -  NRTASKS; 

31  if  (procjir  >=  0) 

32  cause  «ig(proc  nr,  SIGALRM); 

33  else 

34  (*walch_dog[-proc_nr]X); 

rp->p_alarm  =  f>, 

36  j 

37 

38  /*  Work  on  determining  which  alarm  is  next.  */ 

if  (rp->p_alarm  1=  0  &&  rp->p_alarm  <  nexialarm) 

40  nexl_alarm  =  rp->p_alann; 

41  } 

42  } 

43  ) 
44 

45  accountingO;  I*  keep  track  of  who  il  using  the  cpu  •/ 

46 

47  /•  If  input  characters  are  accumulating  on  an  RS232  line,  process  them.  */ 

48  if  (fhishjlag)  { 

49  l  =  (int)  realtime;  /*  only  low-order  bits  matter  •/ 

50  if  (  (t  4  FLUSH_MASK)  =  0)  nJuahO;  /»  Bush  By  input  •/ 

51  } 
52 

53  /*  If  a  user  process  has  been  running  too  long,  pick  another  one.  */ 
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54  if  (-schedticks  =  0)  { 

55  if  (bill_ptr  =  prevjir)  sched();  I*  process  has  ran  too  long  */ 

56  schedticks  =  SCHEDRATE;  /*  react  quantum  */ 

57  prev_ptr  =  biDjW,  /*  new  previous  process  •/ 
58 

59  /*  Check  if  printer  is  hung  up,  and  if  to,  icitait  iL  */ 

60  if  (pr_busy  AA  pcount  >  0  &&  cum_couni  =  previa)  pr_char0; 

61  prev_ct  =  cum_count;/*  record  #  characters  printed  so  far  */ 

62  } 
63 

>  64  f*  if  residence  update  time,  then  update  all  residence  times  '/ 

>  65  if  (resd_ucks  <=  realtime)  { 

>  66  resd  ticks  =  realtime  +  RES_RATE; 

>  67  for  (rp  =  proc_addr(LOWJJSER);  rp  <  &proc[NR_TASKS+NR_PROCS];  rp++) 

>  68  if  (rp->p_flags  1=  P_SLOT_FREE) 

>  69  rp->resjime++; 

>  70  if(swap_stat  =  SWAPJN)  { 

>  71  /*  notify  swap  task  that  residence  times  have  changed  */ 

>  72  mess.m  source  =  CLOCK; 

>  73  mess.m_type  =  CORE_IS_FREE; 

>  74  send(SWAP  TASK,  Amess); 

>  75  } 

76  if(secofids++  >=  5)  { 

77  seconds  =  0; 

78  if(Dflag) 

79  a_dmp0; 

80  } 

81  } 


82  ) 
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10 


17 


36 


4  PRIVATE  int  mini_rec(caller,  lie,  m_ptr) 

5  int  caller,  y*  process  trying  lo  get  message  •/ 

6  int  sic;  f  which  message  source  is  wanted  (or  ANY)  •/ 

7  message  •m_ptr;  /•  pointer  to  message  buffer  •/ 

8  { 

9  /*  A  process  or  task  wants  to  get  a  message.  If  one  is  already  queued, 
*  acquire  it  and  deblock  the  sender.  If  no  message  from  the  desired  source 

11  *  is  available,  block  the  caller.  No  need  lo  check  parameters  for  validity. 

12  *  Users  calls  arc  always  scndrecO,  and  minisendO  has  checked  already. 

13  *  Calls  from  the  tasks,  MM,  and  FS  are  trusted. 

14  •/ 
IS 
16  extern  struct  sw_mm_mes  swap_proc; 

register  struct  proc  *caller_ptr,  *sender_ptr,  *prcvious_plr, 

15  int  sender; 
19 

20  caller_pu  =  proc_addr(caller);   /*  pointer  to  caller's  proc  struct  */ 
21 

22  I*  Check  lo  see  if  a  message  from  desired  source  is  already  available.  */ 

23  sender_ptr  =  caller jMr->p  callerq; 

24  if  ((caUer_ptr->p_flags  &  SENDING)  =  0)  { 

25  while  (sender_ptr  1=  NIL_PROC)  { 

26  sender  =  sender_ptr  -  proc  -  NR_TASKS; 

27  if  (sre  =  ANY  1 1  src  =  sender)  { 

28  /"An  acceptable  message  has  been  found.  */ 

29  cp_mess(sender,  sender_ptr->p_map[D].mem_phys, 

30  iender_Dtr->p_messbuf,  caller jrtr->p_map[DJ.mem_priys1  m_ptr); 

31  sender_ptr->p_flags  &=  SENDING;  /*  deblock  sender  */ 

32  if  (sender_ptr->p_flags  =  0)  ready{sender_ptr); 

33  if  (senderjxr  =  caller_ptr->p_carierq) 

34  callerj>tr->p_cal]erq  =  sender_ptr->p_sendlink; 

35  else 
previous_ptr->p_sendlink  =  sender_ptr->p  sendlink; 

37  retum(OK); 

38  } 

39  previous_ptr  =  senderjxr; 

40  senderjHr  =  sender_ptr->p_sendlink; 

41  } 

42  } 
43 

44  f*  No  suitable  message  is  available.  Block  the  process  trying  to  receive.  */ 

45  caUer_ptr->p_getfrom  =  src; 

46  caller_ptr->p_messbuf  =  m_ptr, 

47  if  (caller_ptr->p_fiags  =  0)  unready(caller_ptr); 

48  callerjxr->p  flags  |=  RECEIVING; 
49 

50  /*  If  MM  has  just  blocked  and  there  are  kernel  signals  pending,  now  is  the 

51  •  time  to  tell  MM  about  them,  since  it  will  be  able  to  accept  the  message. 

52  •/ 

•  53  if(  ((sig_procs  >  0)  1 1  (swap_proc.status  =  MPENDING))  &.&. 
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>  54  (caller  =  MM_PROC_NR  lines  ANY)  )  { 

55  infoimO; 

56  pick_procO; 

57  } 

58  retum(OK); 

59  } 


Appendix  A-7  -  MODIFICATIONS  TO  KERNEL  CODE 


Jul  6  15:13  1989  SWAPPER.C  Page  1 


>  1 

>  2 

>  3 

>  4 

>  5 

>  6 

>  7 

>  8 
»  9 

>  10 

>  11 

>  12 

>  13 

>  14 

>  15 

>  16 

>  17 

>  18 

>  19 

>  20 

>  21 

>  22 

>  23 

>  24 

>  25 

>  26 

>  27 

>  28 

>  29 

>  30 

>  31 

>  32 

>  33 

>  34 

>  35 

>  36 

>  37 

>  38 

>  39 

>  40 

>  41 

>  42 

>  43 

>  44 

>  45 

>  46 

>  47 

>  48 

>  49 

>  50 

>  51 

>  52 

>  53 


/*  This  file  cesiums  the  code  and  data  for  the  swapper  task.  It 
has  a  single  entry  point,  swapjask()  It  accepts  six  message 
types: 

CORE_IS_FREE:      memory  has  been  released  or  can  possibly  be  freed 
CORE_IS_NEEDED:  turntable  process  is  on  swap  device 
SWAP_IN_COMPL:     swapjask  request  has  succeeded 
SWAP  INFAILED:   swapjask  request  has  failed 
SWAP_OUT  COMPL:   swapjask  or  MM  request  has  succeeded 
SWAPOUTFAILED:  swapjask  request  has  failed 

The  input  message  is  format  ml.  The  pa  nun  tiers  are: 

mjype  PROC1       PROC2        PLO      MEM_PTR    UTnjTY 


|  CORE  ISFREE      | 

I- 


I 


I 


I 


COREJS_NEEDED    |unblocked| 


I- 

|  SWAP  IN  COMPL     Iswappedinl  | 

I  SWAP_OUT_COMPL    Iswapedoutl  blocked  | 
| h f j. ,. ,. 

|  SWAP_OUT_FAILED  |  proc  no  |  | 


The  input  message  is  formal  mZ  The  parameters  are: 


I  SWAPINFAILED    |sizeofcor|pausewait| 


I 


I 

^include 

•include 
#include 

•  include, 
•include 

#  in  chide 
•indude 
•indude 
#in  elude 
#indude 


./h/consLh" 
./h/type.h" 
./h/callnr.h" 
./h/com.h" 
./h/error.h" 

"../h/signal.h" 

"consLh" 

"lype.h" 

"gloh" 

"proch" 


/*  constant  definirions  */ 
•define  NOP  1 

#define  SWJNP        2 
•define  SW  OUTP     3 


/*  nothing  in  progress  */ 
I*  swap-in  in  progress  */ 
/*  swap-out  in  progress  */ 


I*  swap  out  algotithm  definitions  "I 

•define  PICKSEE      10000        f  size  is  >=  size  needed  */ 

•define  PICKSTICK    1000        /•  sticky  tat  is  not  set  •/ 

•define  PICKPWBLK  100         /•  proc  is  PAUSE/WAIT  •/ 

•define  P1CKSRBLK      10  /*  proc  is  blocked  on  both  S  &  R  •/ 
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>  54         »denne  PICKNBLK         1  /»  proc  ii  nol  blocked  U  all  •/ 

>  55         #define  PICK1NELL        -20000  /»  proc  is  inelligible  for  iw»p  oul  •/ 

>  56 

>  57         #define  CRESMIN  15  /»  min  residence  time  in  seconds  •/ 

>  58         #define  LONG1  m2_Il        /•  muiige  dot  lo  carry  long  •/ 

>  59 

>  60         I*  swapper  usk  variables  */ 

>  61  PRIVATE  int  inprogres; 

>  62         PRIVATE  message  mes,  irons; 

>  63 

>  64 

>  65       r 

>  66  *  swapjask  ' 

>  67  • 

>  68         PUBLIC  swap  taskO 

>  69         { 

>  70  /*  Main  program  of  swap  usk.   It  determines  which  of  the  6  possible 

>  71  *  calls  this  is  by  looking  at  'mes.m_type'.    Then  it  dispatches. 

>  72  •/ 

>  73  struct  proc  *rp; 

>  74  int  opcode,  pnum; 

>  75  long  bm. 

>  76  phys_clicks  sizeneed; 

>  77  phys_bytes  src_phys,  dst_phys; 

>  78  vir_byles  ptr. 

>  79 

>  80  mit_swap0;  /»  initialize  swap  tables  ■/ 

>  81 

>  82  /*  Main  loop  of  the  swap  usk.  Get  work,  process  it.  sometimes  reply.  •/ 

>  83  while  (TRUE)  { 

>  84  reccive(ANY.  Ames);  /*  go  get  a  message  */ 

>  85  opcode  =  mes.m  type;  /*  extract  the  message  type  V 

>  86 

>  87  switch  (opcode)  { 

>  88  case  COREISFREE: 

>  89  if(swap_sut  =  SWAP_IN)  &&  (inprogres  =  NOP)  { 

>  90  try_to  swinfj; 

>  91  } 

>  92  break; 

>  93 

>  94  case COREJSNEEDED: 

>  95  if(mes.PROCl  1=  0)  ( 

>  96  I*  PROC1  on  swap  device  just  became  unblocked  •/ 

>  97  rp  =  proc_addr(mes.PROCl); 

>  98  rp>p_nags&=  BLOCKED; 

>  99  } 

>  100  if(swap_iut  =  SWAPJDLE)  { 

>  101  try_to_swin0; 
>102                               ) 

>  103  break; 
>104 

>  105  case  SWAPJN  COMPL: 

>  106  if(inprogres  =  SWINP)  ( 
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>  107  inprogrcs  =  NOP; 

>  108  pnum  =  mes.PROCl; 

>  109  rp  =  proc_addr(pnum); 

>  110  ip->p_B.gs  &=  SWAPPED; 

>  1 1 1  ip->res_time  =  0; 

>  112  if  (rp->p_flags  =  0)  ready(rp); 

>  113  Uy_to_swin0; 

>  114  }d« 

>  1 15  printff  S_I_C:  swapin  NOT  in  progress!)); 

>  116  break; 

>  117 

>  118  c.sc  SWAP_JN_FAILED: 

>  119  ifdnprogres  =  SWINP)  { 

>  120  inprogrcs  ■  NOP; 

>  121  sizeneed  =  (phys  dicks)mes.PROCl; 

>  122  bm  •  mes.LONGl; 

>  123  mms.msource  =  SWAPTASK; 

>  124  if(  (pnum  =  pik_oulsw(bm,  sizeneed))  !=  0)  { 

>  125  ip  =  proc  addr(pnum); 
>126  ip->p_nag!  INSWAPPED; 

>  127  unready(rp); 

>  128  inprogres  =  SWOUTP. 

>  129  mms.mtype  -  SWAP_OUT_REQ; 

>  130  mms.PROCl  =  pnum; 

>  131  }  else  { 

>  132  mms.m  type  =  NOSWAP  OUT; 

>  133  ) 

>  134  iend(MM_PROC_NR,  &mms); 

>  135  }  else 

>  136  printfC'SJ  F:  swap  in  NOT  in  ptogressO); 

>  137  break; 

>  138 

>  139  case  SWAP_OUT_COMPL: 

>  140  if(inprogres  =~SWOUTP)  { 

>  141  inprogrcs  =  NOP, 

>  142  ip  m  proe_addr(mes.PROCl); 

>  143  ip.>p  flags  |=  NO_MAP; 

>  144  if(mes.PROC2) 

>  145  ip->P_flags  |=  BIjOCKED; 

>  146  tp->res_lime  =  0; 

>  147  try  to_swin0; 

>  148  }  else 

>  149  piintfr S  O  O  swap  out  NOT  in  progressO); 

>  150  break; 

>  151 

>  152  cue  SWAP  OUT_FAJLED: 

>  153  if(inprogres  =  SWOUTP)  ( 

>  154  ip  =  prec_addr(mes.PROCl); 

>  155  ip->p_flags  4=  SWAPPED; 

>  156  inprogrcs  =  NOP; 

>  157  if  (rp->p_flags  =  0)  ready(rp); 

>  158  }  else 

>  159  printffS_0__F:  swap  out  NOT  in  progressO); 
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>  160 

>  161 


>  162  default:  panicfiwap  task  got  had  message",  mes.m_type); 

>  163  break; 

>  164  } 

>  165  ) 

>  166         } 
>167 

>  168        init_swapO 
>169        { 

>  170  inprogres  =  NOP: 

>  171  swap  proc. slatus  =  NOMES; 

>  172  swapstat  =  SWAP_IDLE; 

>  173         ) 
>174 

>  175         /*  set  up  non-blocking  message  transfer  to  MM  */ 

>  176        set_swapproc(rype,  num) 

>  177         int  type,  num; 

>  178         { 

>  179  inmsjnsouree  =  SWAP  TASK; 

>  180  mms.m_type  =  type; 

>  181  mms.PROCl  =  tmm; 

>  182  swap_proc.ms  =  &mms; 

>  183  swap_proc.sutus  =  MPENDING; 

>  184  if(  ((proc[NR_TASKS  +  MM_PROC_NR].p  flags  &  RECEIVING)  =  0)  |  | 

>  185  (proc[NR_TASKS  +  MM_PROC_NR].p_getfrom  1=  ANY)  ) 

>  186  return; 

>  187  informO; 

>  188         } 

>  189 

>  190        lry_to_swinO 

>  191         { 

>  192        ml  proc; 

>  193  if(  (prcc  •  pikinswfj)  =  0)  { 

>  194  swapstat  =  SWAPIDLE; 

>  195  }  else  { 

>  196  inprogres  =  SWJNP; 

>  197  swapstat  =  SWAPIN; 

>  198  set  swapproc(SWAP_IN  REQ,  proc); 
>199                        } 

>200        } 
>201 

>  202        pikjnswO 
>203         ( 

>  204         /*  returns  the  proc  slot  #  of  lite  most  elligible  proc  on  swap  device  */ 

>  205        /*  to  swap  in.     If  none  are  elligible,  then  0  is  returned  •/ 

>  206         struct  proc  *rp; 

>  207         struct  proc  •slkrp; 

>  208         struct  proc  *noslkrp; 

>  209         unsigned  stkres,  noslkres; 

>  210        int  pick; 

>  211 

>  212  slkrp  =  proc_addr(0); 
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>  213  nostkrp  =  proc  addr(0); 

>  214  jikres  =  0; 

>  215  nostkres  =  0; 
>216 

>  217  for(rp  =  proc_iddi<INIT_PROC_NR  +  1);  rp  <  proc  «ddr(NR_PROCS);  ip++)  { 

>  21 8  if(rp->p_fl.gs  4  PSLOTFREE)  continue; 
>219  if(  (rp->pjlags  A  SWAPPED)  && 

>  220  ((ip->p_fUgs  &  BLOCKED)  =  0)  )  ( 

>  221  if(ip->p_fl«gs  &  STICKY)  ( 

>  222  if(ipores_lime  >=  stkres)  { 

>  223  Bkro  =  ,p; 

>  224  stkres  =  rp->res_time; 
>225                                            } 

>  226  }  else  if(rp->res_time  >=  nostkres)  { 

>  227  nostkrp  =  ip; 

>  228  nostkres  =  ip->res_time; 
>229                                       } 

>230  } 

>23I  } 

>  232  iKsUcip  1=  proc_addr(0)) 

>  233  pick  =  (inlXstkrp  -  proc  addi<0»; 

>  234  else 

>  235  pick  =  (inlXnoslkip  -  proc_addr(0)); 

>  236  ietum(pick); 
>237        ) 

>238 
>239 

>  240        pik_oulsw(map,  sizeneed) 

>  241         long  map; 

>  242        int  sizeneed; 
>243         { 

>  244         /*  returns  the  proc  slot  #  of  the  most  elligible  proc  in  core  */ 

>  245         /*  to  swap  out.     If  none  are  elligible.  then  0  is  returned  */ 
>246 

>  247         struct  proc  *rp,  *htp; 

>  248         int  priority,  hpriority; 

>  249         unsigned  hres; 
>250 

>  251  hpriority  -  0; 

>  252  hres  =  0; 

>  253  tap  =  proc_«idr(0); 

>  254  for(ip  =  proc_addi(INTr_PROC_NR  -tl);  rp  <  proc_addr(NR_PROCS);  rp++)  ( 

>  255  if(rp->p_flags  &  (P_SLOT_FREE  |  SWAPPED))  continue; 

>  256  priority  =  0; 

>  257  if(  (rp->p_map{S].mem_phys  + 

>  258  rp->p_map[S]jnem_Ien  • 

>  259  rp->p_maprT].mem_phys)  >=  sizeneed) 

>  260  priority  +=  PICKS1ZE; 

>  261  if(  (ip->p_flags  &  STICKY)  =  0) 

>  262  priority  «  PICKSTICK; 

>  263  priority  +=  blocktype(rp,  map); 

>  264  if(  (priority  >  hpriority)  |  | 

>  265  (priority  =  hpriority  &&  rp->res_time  >  hres)  )  { 
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>  266  hpriority  =  priority, 

>  267  hies  =  rp->res_lime; 

>  268  hip  =  ip; 
>269                          ) 

>270  } 

>  271  if(bpriority  =  0) 

>  272  ietum(0); 

>  273  ietum(  (intXhip  -  proc_iddi<0))); 
>274         } 

>275 
>276 

>  277        /*  determine  if  process  is  blocked  or  pause/wail,  SEND  &  RECEIVE.  •/ 

>  278         f  or  not  at  .11  •/ 

>  279         blocktype(ip,  map) 

>  280         struct  proc  *ip; 

>  281  long  map; 
>282        { 

>283 

>284  tf(  (map  >>  (ip  -  procaddrtO))  &  \)  SlSl 

>  285  (ro->res  time  >=  CRESMTN)  ) 

>  286  ietum(PICKPWBLK); 

>  287  if(  (ip->pjlags  &  (SENDING  |  RECEIVING)  =  0)  && 

>  288  (ip->iea_time  >=  CRESMTN)  ) 

>  289  ieium(PICKNBLK); 

>  290  if(  (ip->p_flags  &  (SENDING  |  RECEIVING)  =  (SENDING  |  RECEIVING))  && 

>  291  (ip->rcs_ume  >=  CRESMTN)  ) 

>  292  nsuim(PICKINELL); 

>  293  tetum(PICKTNELL); 
>294         ) 
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2 
3 
4 
5 
6 
7 
8 
9 
10 
11 
12 
13 
14 
15 
16 
17 
18 
19 
20 
21 
22 
23 
24 
25 
26 
27 
28 
29 
30 
31 
32 
33 
34 
35 
36 
37 
38 

>  39 
40 
41 

>  42 

>  43 

>  44 

>  45 

>  46 

>  47 

>  48 

>  49 

>  50 
51 
52 
53 


PRIVATE  inl  dofoik(mjjtr) 
message  *m_pu; 

{ 


t*  pointer  10  request  message  */ 


/•  Handle  sysforkfj-   tV  has  forked.  The  child  is  'k2\  */ 


register  struci  proc  arpc; 

register  char  *sptr,  *dptr, 

mlkl; 

mill; 

int  piil; 

int  bytes; 

int  fkswap; 


I*  pointen  for  copying  proc  struct  •/ 

/*  number  of  parent  process  */ 

/*  number  of  child  process  */ 

/*  process  id  of  child  */ 

/*  counter  for  copying  proc  struct  */ 

/*  TRUE,  if  fork  has  swapped  out  child  */ 


kl  =  mjHr->PROCl;  I*  extract  parent  slot  number  from  msg  */ 

k2  =  m_ptr->PROC2;  /•  extract  child  slot  number  */ 

pid  =  mjHi->PID;  /*  extract  child  process  id  */ 

fkswap  =  (ini)mj)tr->UTILITY;  /*  extract  swap  status  of  child  •/ 

if  (kl  <  0  1 1  kl  >=  NR_PROCS  |  |  k2  <  0  1 1  k2  >=  NR_PROCS)retum(E_BAD_PROC); 
rpc  =  proc_addr(k2); 

/*  Copy  parent  'proc'  struct  to  child.  */ 

sptr=  (char*)proc_addr(kl);  f*  parent  pointer  */ 

dpir  =  (char  *)  proc_addr<k2);  f  child  pointer  */ 

bytes  =  sizeof(struct  proc);  f*  it  bytes  lo  copy  */ 

while  (bytes-)  *dptr++  =  •sptr++;  /*  copy  parent  struct  to  child  */ 

rpc->p  flags  |  =  NO_MAP;  I*  inhibit  the  process  from  running  */ 

ipc->p_flags  &.=  PENDING;  /*  only  one  in  group  should  have  PENDING  */ 
rpc->p_pending  =  0; 

rpcopjsd  =  pid;  /*  install  child'*  pid  •/ 

rpc->p_Teg[RET_REG]  =  0;  /*  child  sees  pid  =  0  to  know  it  is  child  */ 


rpc->user_time  =  0; 
rpc->sys_time  =  0, 
rpc->res_lime  =  0, 
rpc->child  mime  =  0; 
rpc->child_stime  =  0; 
if(fkswap)  { 
rpc->p_flags  |  =  SWAPPED; 
if(swap_sut  =  SWAPTDLE)  { 
messjnsource  =  SYSTASK; 
mess.m_type  =  CORE_IS_NEEDED; 
mess.PROCl  =  0; 
send(SWAP_TASK,  Amess); 
} 
} 

retum(OK); 
} 


/*  set  all  the  accounting  limes  lo  0  */ 
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54 
55 


56  *  do_newmap 


58  PRIVATE  im  do_newmap(ro_ptr) 

59  message  *m_ptr;  /*  pointer  to  request  message  */ 

60  { 

61  f*  Handle  sys_newmapO-  Fetch  the  memory  map  from  MM.  */ 
62 

63  reginer  struct  proc  "rp,  *rsrc; 

64  phys_bytes  src_phys,  dstjphys,  pn; 

65  vir_bytes  vmm,  vsys,  vn; 

66  int  caller;  /*  whose  space  has  the  new  map  (usually  MM)  */ 

67  int  lq  /*  process  whose  map  is  to  be  loaded  */ 

68  int  o!d_flags;  f*  value  of  flags  before  modification  */ 

69  struct  mem_map  *map_ptr.       /*  virtual  address  of  map  inside  caller  (MM)  */ 

70  int  mil;  f  TRUE,  if  exec  uses  core  as  a  buffer  */ 
71 

72  f*  Extract  message  parameters  and  copy  new  memory  map  from  MM.  */ 

73  caller  =  m_ptr->m_source; 

74  k  =  m_ptr->PROCl; 

75  map_ptr  =  (struct  memjnap  •)  m_ptr->MEM_PTR; 

76  util  =  (mi)m_j*r->UTILJTY;     /*  extract  swap  status  of  child  */ 

77  if  (k  <  -NR_TASKS  1 1  k  >=  NR_PROCS)  retum(E_BAD_PROC); 

78  ip  =  proc_addr(k);  /*  ptr  to  entry  of  user  getting  new  map  */ 

79  nrc  =  proc_addr(caller);  /*  ptr  to  MM's  proc  entry  */ 

80  vn  =  NRSEGS  *  si2eof(struci  memmap); 

81  pn  =  vn; 

82  vmm  =  (vir_bytes)  map_ptr;      /*  careful  about  sign  extension  */ 

83  vsys  =  (vir_bytes)  rp->p_map;  I*  again,  careful  about  sign  extension  */ 

84  if  (  (src_phys  =  umap(rsrc,  D,  vmm,  vn))  =  0) 

85  panicCbad  call  to  sysnewmap  (sre)",  NO_NUM); 

86  if  (  (dst_phys  =  umap(proc_add"r(SYSTASK),  D,  vsys,  vn))  =  0) 

87  panicCbad  call  to  sysnewmap  (dst)",  NONUM); 

88  phys_copy(src_phys,  dst_phys,  pn); 
89 

90  «ifdef  i8088 

91  /*  On  8088,  set  segment  registers.  */ 

92  rp->p_reg[CS_REG]  =  rpop_maprT].mem_phys;       I*  *a  cs  */ 

93  rp->p_reg[DS_REGJ  =  rp->p_map[D].mem_phys;       f  set  ds  */ 

94  ip->p_reg[SS_REG]  =  ip->p_map[D].mem_phys;       /*  set  ss  •/ 

95  ip->p_regIES_REGl  =  rp->p_map[D].mem_phys;       J*  set  es  */ 

96  #endif 
97 

98  /*  don't  make  process  ready  if  core  is  being  used  as  an  I/O  buffer  V 

>  99  if(!uril){ 

>  100  old_flags  =  ipopflags;       f*  tuve  the  previous  value  of  the  flags  •/ 

>  101  rp->p_ftags  &=   NO  MAP; 

>  102  if  (oW_ftags  1=  0  &&  ip->p_flags  =  0)  ready(rp); 

>  103  }  else  { 

>  104  rp->p_flags  |=  NO  MAP; 

>  105  } 

106  retum(OK); 
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107 
108 
109 
110 


111  •  do_exec 

112  ' 


113  PRIVATE  inl  do_exee(mjlr) 

114  message  *m_plr,  /*  pointer  to  request  message  */ 

115  { 

116  I*  Handle  sys_exec().  A  process  has  done  a  successful  EXEC.  Patch  it  up.  */ 
117 

118  register  struct  proc  *rp; 

119  int  k;  /•  which  process  */ 

120  int  *sp;  f*  new  sp  */ 

121  int  exswap;  /*  TRUE,  if  exec  has  swapped  out  new  proc  */ 
122 

123  It  =  m_ptr->PROCl;  /»  V  tells  which  process  did  EXEC  */ 

124  sp  =  (int  *)  m_ptr->STACK_PTR; 

125  exswap  =  (int)m_ptr->UTIlJTY;  /*  extract  swap  sums  of  new  proc  */ 

126  if  (k  <  0  |  |  k  >=  NRPROCS)  return(E_BADPROC); 

127  rp  =  proc  adcirfk); 

128  rp->p_sp  =  sp;  I*  set  the  stack  pointer  •/ 

129  rp->p_pcpsw.pc  =  (inl  (•)())  0;  /*  reset  pc  */ 

130  rp->p_a]arm  =  0,  I*  reset  alarm  timer  */ 

13 1  rp->p"flags  &=  RECEIVING;  /»  MM  does  not  reply  to  EXEC  call  •/ 
132 

>  133  if(exswap)  { 

>  134  rp->p_flags  |=  SWAPPED; 

>  135  if(swap_stat  =  SWAPIDLE)  ( 

>  136  messjn_source  =  SYSTASK; 

>  137  mess.mjype  •  COREISNEEDED; 

>  138  messPROCl  =  0; 

>  139  send(SWAP  TASK,  &mess); 
>140                            } 

>  141  }  else  if(swap_sut  =  SWAPIN)  { 

>  142  /*  notify  swapper  that  core  has  been  freed  */ 

>  143  mess.m_source  =  SYSTASK; 

>  144  rness.mjype  =  CORE  IS_FREE; 

>  145  send(SWAP_TASK.  ftiness); 
>146           } 

147 

148  if  (rp-:>p_flags  =  0)  ready(rp); 

149  if(exswap) 

150  set_name(k,  (char  *)0);  f*  erase  command  suing  from  Fl  display  •/ 

151  else 

152  setjiamefk,  (char  *)sp);  /*  uve  command  string  for  Fl  display  "/ 
153 

154  retumfOK); 

155  ) 
156 

157 

158  - 

159  •  do  xil 
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160  * 

161  PRIVATE  int  do_xit(m_ptr) 

162  message  •m_plr,  /*  pointer  to  request  message  */ 

163  { 

164  /*  Handle  sys_xil()-   A  process  has  exited.  */ 
165 

166  register  struct  proc  *rp,  *rc; 

167  struct  proc  *np,  *xp; 

168  mt  parent;  /•  number  of  exiting  proc's  parent  •/ 

169  int  proc_nr,  /*  number  of  process  doing  the  exit  */ 
170 

171  r»rent  =  m_ptr->PROCl;  /*  iloi  number  of  parent  process  */ 

172  pn>c_nr  =  m_ptr->PROC2;        /*  slot  number  of  exiting  process  •/ 

173  if  (parent  <  0  1 1  parent  >=  NR_PROCS  |  |  proc_nr  <  0  |  |  procnr  >=  NR_PROCS) 

174  retum(E_BAD_PROQ; 

175  rp  =  proc_addr(parent); 

176  re  =  proc_addr(proc_nr); 

177  rp->child_utime  +=  rc->userjime  +  rc->child_utime;  /*  accum  child  times  */ 

178  rp->child_stime  +=  rc->sys_bme  +  re->child  slime; 

179  rc->p_aUrm  =  0;  f*  rum  off  alarm  timer  •/ 

180  if  (rc->p_flags  =  0)  unready(rc); 

181  iet_name(proc_nr.  (char  *)  0);  I*  disable  command  printing  for  Fl  */ 
182 

183  /*  If  the  process  being  terminated  happens  to  be  queued  trying  to  send  a 

184  *  message  (i.e.,  the  process  was  killed  by  a  signal,  rather  than  it  doing  an 

185  *  EXIT),  then  ii  must  be  removed  from  the  message  queues. 

186  ♦/ 

187  if  (rc-spflags  &  SENDING)  { 

188  f+  Check  all  proc  slots  to  see  if  the  exiting  process  is  queued.  */ 

189  for  (rp  =  AprccIOJ;  rp  <  &proc[NR_TASKS  4  NR_PROCS];  rp++)  { 


tf  (rp->p_callerq  =  NIL_PROC)  continue; 


190 

191  if  (ip->p_calleiq  =  re)  { 

'92  l*  Exiting  process  is  on  front  of  this  queue.  *J 

193  rp->p_callerq  =  re->p_sendlink; 

194  break; 

195  }  else  { 

196  /*  See  if  exiling  process  is  in  middle  of  queue.  */ 

197  np  =  rp->p  callerq; 

198  while  (  (  xp  =  np->p_sendlink)  1=  NIL_PROC) 

199  if  (xp  =  re)  { 

200  np->p_sendlink  =  xp->p_sendlink; 

201  break; 

202  }c]se{ 

203  np  =  xp; 

204  } 

205  } 

206  } 

207  } 

208  if  (rc->p  flags  &  PENDING)  ~sig_procs; 

209  re->p_flags  =  P_SLOT_FREE; 
210 

>  21 1  /*  notify  swapper  thai  core  has  been  freed  */ 

>  212  if(swap_stat  =  SWAPJN)  { 
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>  213  mess.m_type  =  CORE_IS_FREE; 

>  214  •end(SWAP_TASK,  Amess); 
>215          } 

216 

217  rctum(OK); 

218  } 
219 

220 

221  " 

222  • 

223  - 


>  224         PRIVATE  inl  do_lock(m  _ptr) 

>  225         message  *m_j>tr;  /•  pointer  to  request  message  •/ 
>226         ( 

>  227         /*  Handle  sys_lock  or  restore  request  from  MM  *t 
>228 

>  229  if(m_ptr->LOCK_RES  =  LOCK)  ( 

>  230  m_ptr->PSW  =  (int)  lodcO; 

>  231  )  else  { 

>  232  restore(  (unsigned)  m_ptr->PSW); 
>233           ) 

>  234  letum(OK); 
>235         ) 

>236 
>237 
238        .r 


239  •  inform 

240  » 


241  PUBLIC  informO 

242  { 

243  /*  When  a  signal  is  delected  by  the  kernel  (eg.,  DEL),  or  generated  by  a  task 

244  *  (e.g.  clock  task  for  SIGALRM).  causesigO  is  called  to  set  a  bit  in  the 

245  *  p_pending  field  of  the  process  to  signal.  Then  informO  is  called  to  see 

246  *  if  MM  is  idle  and  can  be  told  about  it.  Whenever  MM  blocks,  a  check  is 

247  *  made  to  sec  if  'sig_procs'  is  nonzero;  if  so,  informO  is  called. 

248  */ 
249 

250  register  struct  proc  *rp; 
251 

252  /*  MM  is  waiting  for  new  input.  Find  a  process  with  pending  signals.  */ 

253  /*  does  swapper  want  to  send  message  to  MM  now?  •/ 

>  254  if(swap_proc.sulus  =  MPENDING)  { 

>  255  if  (mini_send(SWAP_TASK,  MM_PROC_NR,  swapj>roc.ms)  !=  OK) 

>  256  panicCcan't  inform  MM",  NOJTOM); 

>  257  swap_proc.sutus  =  NOMES; 

>  258  return: 
>259  } 

260  for  (rp  =  proc_addr(0);  rp  <  proc_addr(NR_PROCS);  rp++) 

261  if  (rp->p_flags  *  PENDING)  ( 

262  m.m  Jype  ■  KSIG; 

263  m.PROCl  >  rp  -  proc  •  NRTASKS; 

264  m.SIG_MAP  =  rp->p_pending; 

265  sig_procs— ; 


Appendix  A- 18  -  MODIFICATIONS  TO  KERNEL  CODE 


Jul  6  15:13  1989  SYSTEM.C  Ptgrs6 


266  if  (miru_iend(HARDWARE.  MMPROCNR,  Am)  !=  OK) 

267  pmicfcBl'l  inform  MM",  NO_NUM); 

268  rp->p_pending  =  0;     /*  the  ball  it  now  in  MM't  court  */ 

269  rp->p_nig!  4=  PENDING; 

270  if  (rp->p_rligi  =  0)  ready(rp); 

271  return; 

272  } 

273  1 
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1 

2 
3 
4 
5 
6 
7 
8 
9 
10 
11 
12 
13 
14 
15 
16 
17 
18 
19 
20 
21 
22 
23 
24 
25 
26 
27 
28 
29 
30 
31 
32 
33 
34 
35 
36 
37 

>  38 
39 
40 
41 
42 
43 
44 
45 
46 
47 
48 

>  49 
50 
51 
52 

>  53 


/*  The  object  file  of  "tablex"  contains  all  the  data    In  the  Ml  files, 
declared  variables  appear  with  EXTERN  in  front  of  them,  as  in 

EXTERN  int  x; 

Normally  EXTERN  is  defined  as  extern,  so  when  they  are  included  in  another 
file,  no  storage  is  allocated.  If  the  EXTERN  were  not  present,  but  just 


then  including  this  file  in  several  source  files  would  cause  V  to  be 
declared  several  times.  While  some  linkers  accept  this,  others  do  not, 
so  they  are  declared  extern  when  included  normally.  However,  it  must 
be  declared  for  real  somewhere.  That  is  done  here,  by  redefining 
EXTERN  as  the  null  string,  so  the  inclusion  of  all  the  *.h  files  in 
table.c  actually  generates  storage  for  them.   All  the  initialized 
variables  are  also  declared  here,  since 

extern  int  x  =  4; 

is  not  allowed.   If  such  variables  are  shared,  they  must  also  be  declared 
in  one  of  the  *Ji  files  without  the  initialization. 
7 


#  in  elude 
•include 
•include 
#include 
•include 
•undef 
•define 

*  include 
•include 
•include 


"../h/consLh" 

"../h/type.h" 

"../h/com.h" 

"consLh" 

"typc.h" 

EXTERN 

EXTERN 

"glo.h" 

"proch" 

"tty.h" 


extern  int  sysjasko,  clock  JaskfJ,  memJaskO,  floppy _task0, 

winchesterJaskO,  ttyjaskO,  printer_usk(),  swapjaskO; 

#ifdef  AMKERNEL 
extern  int  amoeba_lask(); 
extern  int  amint_task0; 
•endif 

/*  The  startup  routine  of  each  task  is  given  below,  from  -NR_TASKS  upwards. 

*  The  order  of  the  names  here  MUST  agree  with  the  numerical  values  assigned  to 

•  the  tasks  in  .Jh/com.h. 
•/ 

•define      SMALL_STACK 

•define      TTY  STACK 
•define  SWAP  STACK 


512 

SMALL_STACK 
SMALL  STACK 
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54 
55 
56 
57 
58 
59 
60 
61 
62 
63 
64 
65 
66 
67 
68 
69 
70 
71 
72 

>  73 
74 
75 
76 
77 
78 
79 
80 
81 
82 
83 
84 
85 
86 
(7 
88 
89 
90 
91 
92 
93 
94 
95 

>  96 
97 
98 
99 

100 
101 
102 
103 
104 
105 
106 


•define  PRINTERSTACK 
•define      WTNCH_STACK 
FLOP_STACK 
MEM_STACK 
CLOCK  STACK 
SYS  STACK 


•define 
•define 
•define 
•define 


SMALL_STACK 
SMALLSTACK 
SMALL~STACK 
SMALL "STACK 
SMALLSTACK 
SMALL  STACK 


•ifdef  AM  KERNEL 


# 

# 

•else 

* 

•endif 


define 
define 
define 


AMINT_STACK 
AMOEBASTACK 
AMOEBA  STACK  SPACE 


SMALL_STACK 

1532 

(AM_NTASKS'AMOEBA_STACK  +  AMTNT_STACK) 


AMOEBA  STACK  SPACE       0 


•define      TOT  STACK  SPACE 


(TTY_STACK  +  AMOEBASTACKSPACE  < 
SWAP  STACK  *  \ 
PRINTERSTACK  +  \ 
WINCHSTACK  +  FLOP  STACK  +  \ 
MEM_STACK  *  CLOCK~STACK  *  SYSSTACK) 


•*  some  notes  about  the  following  table: 

*•  1)  The  uy  task  should  always  be  fim  so  that  other  tasks  can  use  printf 

••        if  their  initialisation  has  problems. 

**  2)  If  you  add  a  new  kernel  task,  add  it  after  the  amoeba_lasks  and  before 

•*       the  printer  task. 

*•  3)  The  task  name  is  used  for  process  status  (Fl  key)  and  must  be  six  (6) 

•*       characters  in  length.  Pad  it  with  blanks  if  it  is  too  short. 

♦/ 


PUBLIC  struct  tasktab  tasklabl] 
ttyjask, 

•ifdef  AM_KERNEL 
amintjask, 
amoebatask, 
amoeba_task, 
amoeba  task, 
amoeba  task. 


TTYSTACK, 


"TTY 


•endif 


AMTjNT_STACK,      "AMTNT ", 
AMOEBASTACK,  "AMTASK", 
AMOEBASTACK,  "AMTASK", 
AMOEBASTACK,  -AMTASK", 
AMOEBASTACK,  "AMTASK", 

■wapjask,  SWAP_STACK,         "SWAPER", 

printer  task,  PRINTER  STACK,   "PPJNTR". 

winchesterusk.         WINCHSTACK,       "WINCHE", 

floppy  task,  FLOP  STACK.         "FLOPPY". 

memjask,  MEM_STACK.  "      -RAMDSK", 

clock Jask.  CLOCK_STACK.      "CLOCK", 

ays  task.  SYSSTACK.  "SYS     ", 

0,  "  0,  "IDLE  ", 

0,  0,  "MM     ", 

0,  0,  "FS     ", 

0,  0.  "INIT  " 
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107         ); 

108 

109  mi  tsuckrTOTSTACKSPACEVsizeof  (inl)]; 

110 

111  inl  k_suckfK_STACK_BYTES/sizeof  (int)];  f  The  kernel  suck.  •/ 

112 

113 

114  /• 

115  ••  The  number  of  kernel  usks  must  be  the  lime  as  NR_TASKS. 

116  **  If  NR_TASKS  is  not  conect  then  you  will  gel  the  compile  error 

117  •*    multiple  case  entry  for  vilue  0 

118  •*  The  function dummy  is  never  called. 

119  •/ 
120 

121  *dcnne  NKT  (sizeof  tasktab  /  sizeof  (struct  lasktab)  -  (TNITPROCNR  +  1)) 

122         dummyO 

123  ( 

124  switdi(0) 

125  { 

126  case  0: 

127  case  (NR_TASKS  =  NKT): 

128  ; 

129  } 

130  ) 


APPENDIX  B  -  MODIFICATIONS  TO  MEMORY  MANAGER  CODE 


Appendix  B-2  -  MODIFICATIONS  TO  MEMORY  MANAGER  CODE 


Jul  6  14:30  1989  MM.H  Page  1 


I  couth 
2 

3  »dcfine  SWAPMODE  0777  /•  mode  to  use  on  swap  device  files  */ 

4 

5 

6  mproc.h 

7 

8  unsigned  mp_isw_map;  /•  bitmap  of  signals  recvd  while  swapped  •/ 

9  inl  mpdeadchild;"  /*  >0  means  WATTING  SWAPPED  proc's  child  died*/ 
10  *define  SWAPPED  0100  /•  process  is  swapped  out  •/ 

II  #define  FKSWAPPED  0200  /*  process  is  swapped  oul  by  fodc  •/ 

12  #define  WASPWS  0400  /•  process  was  P/W  4  SWAPPED  A  then  awakened  •/ 
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1  l> 

2  *  alloc  mem 

3  • 


4  PUBUC  physdicks  aUocjnem(clicks) 

5  phys_clicks  clicks;  /•  amount  of  memory  requested  */ 

6  { 

7  I*  Allocate  a  block  of  memory  from  the  free  lilt  using  first  fit.  The  block 

8  *  consists  of  a  sequence  of  contiguous  bytes,  whose  length  in  clicks  is 

9  •  given  by  'clicks'.  A  pointer  to  the  block  is  relumed.  The  block  is 

10  *  always  on  a  click  boundary.  This  procedure  is  called  when  memory  is 

11  •  needed  for  FORK  or  EXEC. 

12  •/ 
13 

14  register  struct  hole  *hp,  *prev_ptr; 

15  phys_clicks  old  base; 
16 

>  17  while(TRUE)  { 

>  18  hp  =  hole_head; 

>  19  while  (hp  b  NH.HOLE)  { 

>  20  if  (hp->h_len  >=  clicks)  { 

>  21  /*  We  found  a  hole  that  is  big  enough.   Use  it.  */ 

>  22  old_base  =  hp->h_base;  /•  remember  where  it  started  */ 

>  23  hp->h_base  +=  clicks;  /•  bite  a  piece  off  */ 

>  24  hp->h  len  -=  clicks;/*  ditto  */ 

>  25 

>  26  /*  If  hole  is  only  partly  used,  reduce  size  and  return.  •/ 

>  27  if  (hp->hjen  1=  0)  retum(old  J»se); 

>  28 

>  29  /*  The  entire  hole  has  been  used  up.   Manipulate  free  list.  •/ 

>  30  del_slot(prev_ptr,  hp); 

>  31  retum(oldbase); 

>  32  ) 

>  33 

>  34  prev_ptr  =  hp; 

>  35  hp  =  hp->h_next; 

>  36  } 

>  37  if(tot_hole0  <  clicks) 

>  38  "      break; 

>  39  compactO;  /*  mem  is  available,  compact  to  get  it  */ 

>  40  ) 

>  41  retttm(NO_MEM); 

>  42  } 
43 


45  i- 

16  *  tothole 

47  ' 


>  48  PUBLIC  physdicks  totholeO 

>  49  ( 

>  50  I*  Scan  the  hole  list  and  return  sum  of  all  holes.  V 

>  51 

>  52  register  struct  hole  *hp; 

>  53  register  phys_c!icks  total; 
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>  54 

>  55  fap  •  holehead; 

>  56  total  =  0. 

>  57  tlo( 

>  58  total  +=  hp->hjen; 

>  59  )while  ((hp  =  hp->h_ne:i"l)  !=  NIL_HOLE); 

>  60  retum(total); 

>  61  } 

>  62 


63 
M 
65 
66 


{ 

I*  Go  through  the  memory  hole  map  and  eliminate  all  holes  except  one 
*  by  actually  moving  process  images  into  the  empty  spaces. 
•/ 


67  PUBLIC  compactO 

>  68 

>  69 

>  70 

>  71 

>  72 

>  73  phys_clicks  holebase,  old  base,  proc_size; 

>  74  long  hb,  ob.  ps; 

>  75  struct  mproc  •rmp; 

>  76  unsigned  old_sute; 

>  77  int  found; 

>  78 

>  79  iys_lock(LOCK,  ioldstate); 

>  80  whilethole_head->h_next  !=  NIL  HOLE)  { 

>  81  /•  find  first hole*/ 

>  82  hole_base  =  ho!e_head->h  base; 

>  83 

>  84  I*  find  proc  just  above  this  hole  •/ 

>  85  found  =  0; 

>  86  for(nnp  ■  &mproc[INrTJ>ROC  NR  +1];  imp  <  improc|NR  PROCS1;  rmp++)  { 

>  87  if(  ((rmp.>mp_flags  &  INJJSE)  =  0)            1 1 

>  88  (rmp-Mnpflags  &  (SWAPPED  |  FKSWAPPED))  1 1 

>  89  (rmp-J-mpJlags  4  HANGING)  ) 

>  90  continue; 

>  91  if(  (mvpomp  seg[T]jnem_phys  -  bole_base)  =  hole_head->h  len)  { 

>  92  found  •  1; 

>  93  break; 

>  94  } 

>  95  } 

>  96  ifffound  =  0)  { 

>  97  return; 

>  98  } 

>  99  old_base  =  rmp->rnp_segrT].rnem_phys; 

>  100  proc_sizc  =  rmpomp_seg[S,.mein_phys  +  rmp->mp_segIS].mem  len  -  old  base; 

>  101 

>  102  hb  =  (long)  holebase  «  CLICK  SHIFT; 

>  103  ob  =  (long)  oldbase  «  CUCKJSHJFT; 

>  104  ps  =  (long)  procsize  «  CLICKSHDT; 

>  105 

>  106  if(mem_copy(ABS.O.ob,  ABS.OJib,  ps)  =  OK)  ( 
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>  107  f*  free  old  core*/ 

>  108  free_inem(old_base,  proc_size); 

>  109  /*  allocate  new  core  */ 

>  110  if(hole_base  1=  alloc_mem(proc_nze))  { 

>  1 1 1  «ys  _lock(RESTORE,  Aoldstaie); 

>  112  panicC'compact  -  «lloc  error",!); 

>  113  } 

>  114 

>  115  I*  setup  process  table  map  */ 

>  116  rmp->mp_seg[S].mem_phys  =  rmp->mp_seg[S].mem_phys  - 

>  117  rmp->mp_segrT].mem_phys  +  hole__b*se; 

>  118  rmp->mp_seg[D]jnemjjhys  =  rmp->mp_«egrT]jnern_Ien   +  ncde_base; 

>  119  rmp->mp_segrr]-nrem_phys  =  hole  base; 

>  120  /•tell  kernel  */ 

>  121  iys_newmap(  (int)  (imp  ■  mproc),  rmp->mp_seg,  FALSE); 
>122  } 

>  123 


>  124  } 

>  125  sys_lock(RESTORE,  Aold  slate); 

>  126         } 
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1  /» 

2  *  do  exec 


4  PUBLIC  int  do_exec() 

5  { 

6  f  Perfomj  the  execvefname,  argv,  envp)  call.  The  user  library  builds  a 

7  *  complete  stack  image,  including  pointers,  args,  environ,  etc.  The  suck 

8  *  is  copied  lo  a  buffer  inside  MM,  and  then  to  the  new  core  image. 

9  •/ 
10 

11  char  mbuf [MAX_ISTACK_BYTES];         /•  buffer  for  stack  and  zeroes  */ 

12  char  swap_name[4]; 

13  char  *new_sp,  'a,  *psrc,  *pdst; 

14  int  s,  r,  in_fd,  out  fd,  swap  fd,  ft,  sd; 

15  int  swapout  =  FALSE; 

16  unsigned  loadbytes; 

17  vir_bytes  sre,  dst,  lext_bytes,  daia_bytes,  bssjjytes,  stk_bytes,  vsp; 

18  phys_bytes  lot_bytes;  /*  total  space  for  program,  including  gap  */ 

19  physclicks  oldclk,  dbufjen; 

20  long  sym_bytes,  ttbytes,  xtrabytes; 

21  vir_clicks  sc; 

22  struct  mproc  *rmp,  *tmpmp; 

23  struct  stat  ijbof,  djxif; 

24  struct  memmap  tmm[NR_SEGS]; 

25  union  u  { 

26  char  nameJ»f[MAX_PATH];     /*  the  name  of  the  file  to  exec  */ 

27  char  2b[ZEROBUF_SIZE];  f*  used  to  zero  bss  */ 

28  }u; 
29 

30  /"Do  some  validity  checks.  */ 

3 1  imp  =  mp; 

32  stk_bytes  =  (virbyies)  stackbytes; 

33  if  (stk_bytes  >  MAX_ISTACK_BYTES)  retumfENOMEM);       /*  slack  too  big  */ 

34  if  (exec  fan  <=  0  1 1  exec  ta  >  MAX  PATH]  retum(EINVAL); 
35 

36  I*  Get  the  exec  file  name  and  see  if  the  file  is  executable.  */ 

37  sre  =  (vir_bytes)  exec_name; 

38  dst  =  (virbyies)  u.name_buf; 

39  r  =  mern_copy(who,  D,  (long)  sre,  MM_PROC_NR,  D,  (long)  dst,  (long)  execjen); 

40  if  (r  1=  OK)  retum(r);  /*  file  name  not  in  user  data  segment  */ 

41  teH_fs(CHDIR,  who,  0,  0);        /*  temporarily  switch  to  user's  directory  */ 

42  in_fd  =  allowed(u.name_buf,  &s_buf,  XBIT);  /*  is  file  executable?  */ 

43  tell_fs(CHDIR,  0,  1,0);  /*  switch  back  to  MM's  own  directory  */ 

44  if  (in  fd  <  0)  retum(in_fd);       f  file  was  not  executable  */ 
45 

46  f*  Read  the  file  header  and  extract  the  segment  sizes.  */ 

47  sc  =  (slk_bytes  +  CLICK J5EE  - 1)  »  CLICK_SHIFT; 

48  if  (read_header(in_fd.  Aft,  &texl_bytes,  AdaU_bytes,  Abts  bytes, 

49  &tot_bytes,  Asymbytes,  sc)  <  0)  { 

50  dose(in_fd);  /*  something  wrong  with  header  */ 

51  retum(ENOEXEC); 

52  } 


53 
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54  /*  Fetch  the  suck  from  the  user  before  destroying  the  old  core  image.  */ 

55  ire  =  (vir_bytes)  suck_ptr; 

56  dst  =  (vir_bytes)  mbuf; 

57  if  (mem_copy(who.  D,  (long)  ire,  MM_PROC_NR,  D,  Gong)  dst, 

58  (long)  «k_bytea)  1=  OK)  { 

59  cloie(m  fd);  /*  csai't  fetch  suck  (e.g.  bad  virtual  addr)  */ 

60  retum(£ACCES); 

61  } 
62 

63  f  Allocate  new  memory  and  release  old  memory.   Fix  map  and  tell  kernel.  */ 

•  64  r  =  new_mem(text_byles,dau_bytes,  bss  bytes.  stk_bytes,  tot_bytes, 

'  65  u.zb,  ZEROBUF_SIZE,  tmm,  AtflU  elk); 

►66  if  (r  =  EXSWAPD)  { 

-  67  swapout  =  TRUE; 

-68  }  else  if  (r  1=  OK)  { 

•  69  close(in_fd);                                /*  insufficient  core  or  program  too  big  */ 
►  70  retumfr); 

•  71  } 
72 

•  73  if(!  swapoui)  { 

74  /■  Patch  up  suck  and  copy  it  from  MM  to  new  core  image.  */ 

75  vsp  =  (vir_bytes)  (rmp->mp_seg[S].mem_vir  «  CLICK_SHIFT); 

76  vjp  +=  (virjjytes)  (rmp->mp_seg(S].mem_len  «  CUCK_SHJJT); 

77  vsp  -=  stk_bytes; 

78  patch_ptr(mbuf,  vsp); 

79  sre  =  (vir_bytes)  mbuf; 

80  r  =  mem_copy(MM_PROC_NR,  D,  (long)  sre,  who,  D,  (long)  vsp, 

81  (longj^tkbytes); 

82  if  (r  !=  OK)  panic("do_ejiec  suck  copy  err",  NO_NUM); 
83 

84  /*  Read  in  text  and  dau  segments.  */ 

85  load_seg(who,  in_fd,  T,  text_bytes); 

86  load_seg(who,  in_fd,  D,  daU_bytes); 

87  #ifdef  ATARI JiT 

88  if  (Ueek(in_fd,  sym_by.es,  1)  <  0) 

89  ;  /•  error  */ 
if  (rclocate(in_fd,  mbuf)  <  0) 

;  I*  error*/ 


/*  read  T  &  D  from  a.out  file  and  write  to  swap  device  •/ 

/*  create  swap  file  */ 

/*  change  to  swap  directory  */ 

tell_fs(CHDIR,  0,  2,  0); 

tmpmp  =  mp; 

mp  =  &mproc[MMPROC_NR); 

/•  get  swap  file  name  */ 

td  =  who; 

a  =  swap  name; 

whifc(sd)~{ 

••++  -  (id  %  10)  +  060; 

id/=  10; 


90 

91 

92 

«endif 

>  93 

}else{ 

>  94 

>  95 

>  96 

>  97 

>  98 

>  99 

>100 

>  101 

>  102 

>  103 

>  104 

>  105 

>  106 
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>  107  } 

>  108  •«  •  Ot 

>  109  outjd  =  aJlowed(swap_name,  &s_buf.  W_BIT);  /*  swapjtle  wrtblc?  •/ 

>  110  ■  ■  allowed(".-,  idjjuf.  WETT); 

>  111  mp  =  tmpmp; 

>  1 12  if  (outfd  >=  0)  close(oiit_fd); 

>  113  it  (^  >-  0)  closets;; 

>  114  if  (i  >=  0  &£  (outfd  >=  0  |  |  outjd  =  ENOENT))  { 

>  115  /*  File  is  writable  or  doesn't  exist  &  dir  is  writable  •/ 

>  116  outjd  =  creal(iwap_name,  SWAP_MODE); 
>117  }dse( 

>  118  tell_fs(CHDIR,  0,  1,  0);  /*  go  beck  to  MM '•  own  dir  */ 

>  119  retum(ERROR); 
>120  ) 

>  121  tell_fs(CHDIR,  0,  1,  0);  /•  go  back  to  MM'i  own  dir  */ 

>  122  if  (outfd  <  0)  rentmfERROR); 
>I23 

>  124  f*  change  existing  core  to  a  data  buffer  •/ 

>  125  /*  get  max  available  size  */ 

>  126  dbuflen  =  MIN(2047,  oldclk); 
>127 

>  128  r*  setup  data  buffer  */ 

>  129  rmp->mp_seg[D]jneinjjhys  =  rmp->mp  segIT].mem_phys; 

>  130  rmp->mp_seg[D]jnem_len  =  dbuf_lcn; 

>  131 

>  132  /•tell  kernel  about  the  buffer  •/ 

>  133  sys_newmap(who,  rmp->mp_seg,  TRUE); 

>  134 

>  135  swap_fd  =  (who  «S)|(D  «  6)  |  oul_fd; 

>  136  for(r=0;  K2;  r++)  { 

>  137  tlbytes  =  r  7  (Iong)data_bytes  :  (]ong)lextJ)ytes; 

>  138  xtrabytes  =  Oong) 

>  139  ((r  7  lmm[D]jnem_len  :  tmm[T).memJen)  «CUCK_SHIFT) 

>  140  -  ttbytes; 

>  141  while(ttbytes)  { 

>  142  loadbytes  =  (unsignedXMTN(  Gong)ubvtes, 

>  •«  (longXdbuf  Jen  «  CUCK_SHIFT)  )); 

>  144  /»  read  from  a.out  file  ♦/ 

>  145  load_seg(who,  in_fd,  D,  loadbytes); 

>  146  /*  write  it  to  swap  device  */ 

>  147  a  at  (char  ,Xrmp->mp_seg[DJ.mem_vir  «  CLICKSHrFT); 

>  148 

>  149  if(write(swap_fd,  a,  loadbytes)  !=  loadbytes)  { 

>  150  dose(in_fd); 

>  151  close(out_fd); 

>  152  panicCdo_exec  swap  device  write  err",  NO  NUM); 

>  153  } 

>  154  tlbytes  -=  (long)loadbytes; 

>  155  ) 

>  156  if(xtrabytes) 

>  157  /•  the  number  of  bytes  in  the  swap  file  for  each        */ 

>  158  /*  segment  must  be  equal  to  the  length  of  the  segment  •/ 

>  159  if(lseek(out_fd,  xtrabytes,  1)  <  0) 
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>  160  printfClteek  enoiO); 

>  161  ) 
>162 

>  163  f*  wrile  tuck  to  swap  device  */ 

>  164  /*  first  fix  mproc  memory  map  */ 

>  165  psrc  =  (char  *)Qnm; 

>  166  pdst  =  (char  *)mtp->mp_seg; 

>  167  whjlc(psrc  1=  ((char  *)tmm  +  si/eof(tmm))  ) 

>  168  •(pd!i++)  •  '(psm-H-); 

>  169  vsp  =  (vir_bytes)  (imp->mp_ieg[S]jnem_vir  «  CUCK_SHIFT); 

>  170  vtp  +=  (vir_bytes)  (rmp->rnp_seg[SJ.mem_len  «  CUCK_SHIFT); 

>  171  vsp  ■=  stkjjyles; 

>  172  patchjMifmbuf,  vsp); 

>  173  a  =  mbuf; 

>  174  if(lieek(otn_fd.  (longXvsp  -(lmm|S|.roem_vir  «  CUCK  SHIFD).l) 

>  175  <  (long)0) 

>  176  printfflseek  erroiO); 

>  177  if(wrile(out_fd,  a,  (unsigned)stk_bytcs)  !=  stk  byles)  { 

>  178  close(in_fd); 

>  179  close(t»it_fd); 

>  180  panic("do_exec  swap  device  write  err",  NO  NUM); 

>  181  } 

>  182  closetoutfd); 

>  183  fiee_mem(mip->mp_segP*]-mem_phys,  old_clk);  I*  free  the  memoiy  */ 

>  184  imp->mpjlagj  INSWAPPED;   /•  mark  mptoc  as  swapped*/ 

>  185  } 
186 

187  dose(in_fd);  /•  don't  need  exec  file  any  mote  */ 

188 

189  /•  Take  care  of  setuid/selgid  bits.  •/ 

190  if  (s_buf.st_mode  4  I_SET_UID_BrT)  ( 

191  rmp->mp_effuid  =  s_buf.st_uid; 

192  tell_fs(SETUID,  who,  (int)  rmp->mp_reajuid,  (int)  imp->mp  effuid); 

193  } 

194  if  (sbuf.stjnode  &  ISETGIDBrT)  ( 

195  rmp->mp_effgid  =  s_buf.st_gid; 

196  Iell_fs(SETGID.  who,  (int)  rmp->mp  realgid,  (int)  mip->mp_effgid); 

197  ) 
198 

199  I*  Fa  up  aome  'mproc'  fields  and  tell  kernel  that  exec  is  done.  •/ 

>  200  rmp->mp_dcadchi]d  =  0;  /*  reset  swap  wait  */ 

>  201  rmp->mp_ssw_map  =  0;  /*  reset  all  swap  signals  */ 

202  imp->mp_calcfi  =  0;  /*  reset  all  caught  signals  */ 

203  rmp->mp_flags  &=  SEPARATE;  /•  mm  off  SEPARATE  bit  •/ 

204  rmp->mp_flags  |=  ft;  /*  mm  it  on  for  separate  I  &  D  files  */ 
205 

206  new_sp  =  (char  *)  vsp; 

>  207  sys_exec(who,  new_sp,  swapoot); 

208  retum(OK); 

209  ) 
210 

211 

212  f 
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213 
214 


215  PRIVATE  int  new_mem(text_bytes,  dala_bytes,  bss_bytes,  ilk  bytes, 

216  lotbytes.  bf,  zs,  tmm,  oldclk) 

217  vir_bytes  text_bytes;  /*  text  icgmenl  size  in  bytes  */ 

218  vir_bytes  data_bytes;  /*  size  of  initiilized  data  in  bytes  */ 

219  virjjytes  bssbytes;  /*  size  of  bss  in  bytes  •/ 

220  vir_bytes  stk_bytes;  /*  size  of  initial  suck  segment  in  bytes  */ 

221  phyi_bytes  tot_byies;  /*  total  memory  to  allocate,  including  gap  */ 

222  char  bf[ZEROBUF_SlZE];  /•  buffer  to  use  for  zeroing  data  segment  */ 

223  int  zs;  ^true  size  of  'br  */ 

224  struct  mem  map  tmm [];  /*  temporary  memory  map  */ 

225  phys_clicks~*old_dk;  /*  #  of  clicks  in  old  process  */ 

226  { 

227  /*  Allocate  new  memory  and  release  the  old  memory.  Change  the  map  and  report 

228  *  the  new  map  to  the  kernel.  Zero  the  new  core  image's  bss,  gap  and  suck. 

229  */ 
230 

231  register  struct  mproc  *rmp; 

232  vir_clicks  text_clicks,  daU_clicks,  gap_cticks,  stack_clicks,  toi_clicks; 

233  phys_clicks  new_base; 

234  extern  phys_dicks  alloc_mem0; 

235  extern  phys_clicks  tot  holeO; 

236  #ifdef  ATARIST 

237  phys_clicks  base,  size; 

238  #else 

239  char  "rzp; 

240  vir_bytes  vzb; 

241  phys_clicks  old  clicks; 

242  phys_bytes  bytes,  base,  count,  bss_offset; 

243  #endif~ 
244 

245  f  Acquire  the  new  memory.  Each  of  the  4  parts:  text,  (data+bss),  gap, 

246  *  and  stack  occupies  an  integral  number  of  clicks,  starting  at  click 

247  *  boundary.  The  data  and  bss  parts  are  run  together  with  no  space. 

248  •/ 
249 

250  textclicks  =  (text_bytes  +  CL1CK_SEE  -  1)  »  CLICK_SHIFT; 

251  data_clicks  =  (daubytes  +  bss_bytes  +  CLICK  SIZE  -  1)  »  CLICK  SHUT; 

252  stackclicks  =  (stkbytes  +  CLICKSIZE  -  1)  >>  CUCKSrUFT; 

253  lotclicks  =  (tot_bytes  +  CUCKSIZE  -  1)  »  CUCK_SHIFT; 

254  gap_clicks  =  lotclicks  -  data_clicks  -  stackclicks; 

255  if  (  (int)  gap  clicks  <  0)  retum(ENOMEM); 
256 

257  imp  ■  mp; 

258  #ifndef  ATARIST 

259  oldclicks  =  (phys_clicks)  nnp->mp_ieg[Sljnem_len; 

260  old_clicks  +=  (rmp->mp_ieg[S].mem_vir  -  rmp->mp  seg[D].mem  vir); 

261  if  (rmp->mp_flags  &  SEPARATE)  oldclicks  +=  rmp->mp_seg[T].mem  len; 

262  #endif 
263 

264  /*  Check  to  see  if  there  is  a  hole  big  enough.  If  so,  we  can  risk  first 

265  *  releasing  the  old  core  image  before  allocating  the  new  one,  since  we 
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266  *  know  ii  will  succeed.  If  there  is  not  enough,  return  failure. 

267  */ 

>  268  if(  (text_dicks  +  tot_clicks)  >  (tot_hole()  +  oldclicks)  )  { 

>  269  I*  core  is  not  available,  must  swap  proc  out,  first  get  new  map  */ 

>  270  tmm[T]-mem_len  =  text_clicks; 

>  271  tmm[T].mern_phys  =  nnp->mp_segrT].mem_phys; 

>  272  tmm[D].mem_len  =  data  clicks; 

>  273  tmm[D]jnemj>hys  =  tram(T]jneni_phys  +  text  clicks; 

>  274  tmm[S]jnem_len  =  itackclickj; 

>  275  tmm[S].mem_phys  =  tmm[D].mem_phys  +  dalaclicks  +  gap_clicks; 

>  276  tmm[T]jnem_vir  =  0; 
>277  tmmp]jnem_vir  =  0; 

>  278  tmm[S].mem_vir  =  tmm[D]jnem_vir  +  data_clicks  +  gap_clicks; 

>  279  *old_clk  =  ddclicks; 

>  280  retumfEXSWAPD); 

>  281  } 
282 

283  #ifndef  ATAR1_ST 

284  /*  There  is  enough  memory  for  the  new  core  image.  Release  the  old  one.  */ 

285  free_mem(rmp->mp_segm-men\phys,  old_clicks);    /*  free  the  memory  */ 

286  #endif 
287 

288  I*  We  have  now  passed  the  point  of  no  return.   The  old  core  image  has  been 

289  *  forever  lost.   The  call  must  go  through  now.   Set  up  and  report  new  map. 

290  */ 

291  newbase  =  alloc_mem(iext_clicks  +  tolclicks);        /*  new  core  image  */ 

292  if  (new  base  =  NO_MEM)"panicCMM  hole  list  is  inconsistent",  NO_NUM); 

293  rmp->mp_seg[T].mem_]en  *  text  clicks; 

294  rmp->mp_seg[T].mem_phys  =  new_base; 

295  rmp->mp_seg[D).mem_len  =  data_clicks; 

296  rmp->mp_seg[D].mem_phys  =  new_base  +  text_clicks; 

297  imp->mp_seg[S].mem_len  =  stack  clicks; 

298  rmp->mp_seg[S].mem_phys  ■  nnpomp  seg[D].mem_phys  +  data_clicks  +  gap  clicks; 

299  #ifdef  ATARI_ST 

300  rmp->mp_seg[T].mem_vir  =  imp->mp_seg[T]tnem_phys; 

301  rmp->mp_seg[D].mem_vir  =  rmp->mp_seg[D].mem_phys; 

302  imp->mp_5eg[S].mem  vir=  imp->rnp_seg[S].mem_phys; 

303  #else 

304  rmp->mp_seg[T].mem_vir  =  0; 

305  rmp->mp_seg[D].mem  vir  =  0; 

306  rmp->mp  seg[S].mem_vir  =  rmp->mp_segrD].mem_vir  +  dau_clicks  +  gap_clicks; 

307  #endif 

308  #ifdef  ATARIST 

309  iyi_fresh(who,  rmp->mp_seg,  (phys_clickiXdau_bytes  »  CLICK_SHIFT), 

310  Abase,  Asize); 

311  free_mem(base,  size); 

312  #else 

313  fyijiewmapfwho,  nnp->mp  <eg,  FALSE);  /•  report  new  map  to  the  kernel  V 
314 

315  /•  Zero  the  bss,  gap,  and  stack  segment.  Start  just  above  text.   •/ 

316  for  (rzp  =  AbflO];  rzp  <  Abflzs];  rzp++)  "rzp  =  0;      /*  dear  buffer  */ 

317  bytes  =  (phys  bytes)  (data_dicks  +  gap  clicks  +  stack_clicks)  «  CLICK  SHIFT; 

318  vzb  =  (virbytes)  bf; 
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319  base  =  (long)  rmp->mp_seg[T]  jnem_phys  +  nnp->mp_teg[T]mem  len; 

320  base  =  baie  «  CLICK  SHIFT; 

321  bss_offset  =  (databytes  »  CUCK_SHIFT)  «  CUCK_SHIFT; 

322  base  +=  bss_offsei; 

323  byles  -=  bssjaffset; 
324 

325  while  (bytes  >  0)  { 

326  count  =  (long)  MIN(bytei,  (phys_bytes)  zs); 

327  if  (mem_copy(MM_PROCNR,  D,  (long)  vzb,  ABS.  0.  base,  count)  !=  OK) 

328  panicCnewjnem  can't  zero**,  NONUM); 

329  base  +=  count; 

330  bytes  -=  count; 

331  } 

332  #endif 

333  retum(OK); 

334  } 
335 

336 

337        r 


338  *  load  seg 

339  * 


>  340         PUBLIC  load_seg(usr,  fd,  seg,  segbytes) 

>  341         int  usr,  /*  user  slot  in  proc  table  */ 

342  int  fd;  /*  file  descriptor  to  read  from  */ 

343  int  seg;  /•  T  or  D  V 

344  virbytes  segbytes;  /*  how  big  is  the  segment  */ 

345  { 

346  /*  Read  in  text  or  dam  from  the  exec  file  and  copy  to  the  new  core  image. 

347  •  This  procedure  is  a  little  bit  tricky.  The  logical  way  to  load  a  segment 

348  *  would  be  to  read  it  block  by  block  and  copy  each  block  to  the  user  space 

349  *  one  at  a  time.   This  is  too  slow,  so  we  do  something  dirty  here,  namely 

350  *  send  the  user  space  and  virtual  address  to  the  file  system  in  the  upper 

351  *  10  bits  of  the  file  descriptor,  and  pass  it  the  user  virtual  address 

352  *  instead  of  a  MM  address.  The  file  system  copies  the  whole  segment 

353  *  directly  to  user  space,  bypassing  MM  completely. 

354  */ 
355 

356  int  new_fd,  bytes; 

357  char  *ubuf_ptr, 

358  struct  mproc  *rmp; 
359 

>  360  new_fd  =  (usr  «  8)  |  (seg  «  6)  |  fd; 

>  361  rmp  =  &mproc[usr]; 

362  ubufjHr  =  (char  *)  ((vir_byies)rmp->mp_seg[seg]jnem_vir  «  CUCK_SrflFT); 

363  while  (seg  bytes)  { 

364  bytes  =  31*1024;  /•<=  32767  •/ 

365  if  (segbytes  <  bytes) 

366  bytes  ■  (int)seg_bytes; 

367  if  (read(new_fd,  ubuf_ptr,  bytes)  1=  bytes)  { 

368  panicCloadseg  read  err",  NO_NUM); 

369  break;  /*  error  */ 

370  } 

371  ubuf_ptr  +=  bytes; 
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372  ieg_bytes  -=  bytes; 

373  ) 

374  } 
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1  /» 

2  •  do  foil 


4  PUBLIC  int  doforkO 

5  { 

6  /*  The  process  pointed  to  by  'mp'  his  forked.  Create  •  child  process.  •/ 
7 

8  register  struct  mproc  •rmp;       /*  pointer  to  parent  •/ 

9  register  struct  mproc  *rmc;        /*  pointer  to  child  V 

10  int  L  child_nr,  I; 

11  char  *«ptr,  *dptr, 

12  phys  clicks  prog_dicks,  child J»se; 

13  extern  phys_clicks  a]loc_mem0; 

14  extern  phys_dicks  Iot_hc4c0; 

15  int  swapout  =  FALSE; 

16  vir_bytes  new_sp; 

17  flfndef  ATARIST 

18  long  prog_bytes; 

19  long  parent_abs,  child_abs; 

20  #endif 
21 

22  r  If  ubles  might  fill  up  during  FORK,  don't  even  start  since  recovery  half 

23  *  way  through  is  such  a  nuisance. 

24  •/ 
25 

26  rmp  =  mp; 

27  if  (procsjnjise  =  NRPROCS)  retumfEAGAIN); 

28  if  (procsjnjise  >=  NR_PROCS  -  LASTFEW  4*  rmp->mp  effuid  1=  0)rctum(EAGATN)- 
29 

30  /*  Determine  how  much  memory  to  allocate.  */ 

31  prog_cIicks  =  (phys_clicks)  rmp->mp_scg[S].mem_len; 

32  prog_dicks  +=  (rmp->mp_seg[S).mem_vir  -  rmpomp_seg[D].mem_vir); 

33  (rUhdef  ATARIST 

34  if  (imp->mp_nags  &  SEPARATE)  progdicks  +=  rmp->mp_segrT].memJen; 

35  prog_bytes  =  0ong)  prog_clicks  «  CLICK_SHIFT; 

36  *endif 
37 

>  38  if  (  (prog_click5  >  tot_hole0)  I  1 

>  39  ( (childbase  =  aUoc_mem(prog_clicks»  =  NO  MEM) )  { 

>  40  swapout  =  TRUE; 

>  41  I*  adjust  parents  memory  map,  if  necessary  */ 

>  42  fys_getsp(who,  &new_sp); 

>  43  if(adjust(rmp,  (vir_clicks)  rmp->mp_seg[Dl.mem  len,  new_sp)  1=  OK) 

>  44  retumfEAGAIN); 

>  45  ) 
46 

47  #ifndef  ATARI_ST 

>  48  if(l  swapout)  { 

49  /•  Create  a  copy  of  the  parent's  core  image  for  the  child.  */ 

50  childabs  -  (long)  child  base  «  CUCKSHTFT; 

51  parentabs  =  (long)  rmp->mp_seg|Tlmem_plrys  «  CLICKSHTFT; 

52  i  =  mem_copy(ABS,  0,  parent  abs.  ABS,  0,  child  abs,  prog_bytes); 

53  if  (  i  <  0)  panicCdo_fork  can't  copy",  i); 
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54  } 

55  #endif 
56 

57  /•  Find  a  slot  in  'mproc'  for  the  child  process.   A  slot  must  exist.  */ 

58  for  (rmc  =  &mproc[0];  rmc  <  AmprocrNR_PROCS];  rmc++) 

59  if  (  (nnc->mp  flags  &  INUSE)  =  0)  break; 
60 

61  /*  Set  up  the  child  and  its  memory  map;  copy  its  'mproc'  slot  from  parent.  */ 

62  child_nr  =  (intXmic  -  mproc);  /*  slot  number  of  the  child  */ 

63  aplr  =  (char  *)  imp;  f*  pointer  to  parent's  'mproc'  slot  •/ 

64  dptr  =  (char  *)  rmc;  /•  pointer  to  child's 'mproc*  slot  •/ 

65  i  =  sizeof(struct  mproc);  /*  number  of  bytes  in  a  proc  slot.  •/ 

66  while  (i~)  •dptr++  =  *sptr++;/*  copy  from  parent  slot  to  child's  */ 
67 

68  imc->mp_parent  =  who;  /*  record  child's  parent  */ 

69  #ifndef  ATAR1ST 

>  70  if(]  swapout)  ( 

71  rmc->mp_seg[Tj.mem_phys  =  child  base; 

72  rmc->mp_seg[D].mem_phys  =  child_base  +  rmc->mp_seg[T].memJen; 

73  nnc->mp_seg[S).mem_phys  =  rmc->mp_seg[D].mem_phys  + 

74  (rmp->mp_seg[S].mem_phys  •  rmp->mp  seg[D].mem_phys); 

>  75  )  else  ( 

>  76  I*  swapout  parent's  image  for  child,  don't  free  parent's  core  •/ 

>  77  if(  swap_out(child_nr,  rmc,  rmp.  FALSE)  1=  OK)  { 

>  78  retumfEAGATN); 

>  79  )  else  { 

>  80  rmc->mp  flags  |=  FKSWAPPED; 

>  81  ) 

>  82  ) 

83  ffendif 

84  rmc->mp_exilstalus  =  0; 

85  rmc->mp_sigslatus  =  0; 

>  86  rmc->mp_deadchild  =  0;  I*  reset  swap  wait  */ 

>  87  rmc->mp_ssw_map  =  0;  I*  reset  all  swap  signals  */ 
88             procs  in_use++; 

89 

90  /*  Find  a  free  pid  for  the  child  and  put  it  in  the  table.  */ 

91  do  < 

92  1  =  0;  /•  'i'  =  0  means  pid  still  free  */ 

93  nextjjid  =  (nextjjid  <  30000  7  next_pid  +  1  :  ENTrPROCNR  +  1); 

94  for  (rmp  =  &mproc[0);  rmp  <  &mprocrNR_PROCSl;  nnp++) 

95  if  (rmp->mp_pid  =  next_pid  1 1  rmp->mp_procgrp  =  next  pid)  { 

96  t.l; 

97  break; 

98  } 

99  rmc->mp_pid  =  nexl_pid;  /•  assign  pid  to  child  */ 
100            )  while  (1); 

101 

102  I*  Set  process  group.  */ 

103  if  (who  =  rNTTPROCNR)  rmc->tnp_procgrp  =  rmc->mp_pid; 
104 

105  /*  Tell  kernel  and  file  system  about  the  (now  successful)  FORK.  •/ 

106  #ifdef  ATARI  ST 
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107  sys_fork(who,  childnr,  imc->mp_pid,  child  base); 

108  #else 

>  109  lys_fork(who,  child_nr,  rmc->mp_pid,  swapoul); 
110         (toidif 

111 

1 12  tell_fs(FORK,  who,  childnr,  rmc->mp_pid); 

113 

114  tHfndef  ATARIST 

115  /•  Repoit  child's  memory  map  to  kernel.  •/ 

116  if(!  iwapout)  { 

117  iys_newmap(child_nr,  rrnc->fnp_»eg,  FALSE); 

118  } 

119  «a>dif 
120 

>  121  tf(!  swapoul)  { 

122  /•  Reply  lo  child  to  wake  it  up.  */ 

123  reply(child_nr,  0,  0,  NH.PTR); 
>124           ) 

125  retum(next_pid);  /*  child's  pid  */ 

126  ) 
127 


128 

129  /• 

130  *  do_mm_exit 

131  ♦ 


132  PUBLIC  int  do_mm_exit0 

133  { 

134  /*  Perform  the  exit(status)  system  call.  The  real  work  is  done  by  mm  exitO, 

135  *  which  is  also  called  when  a  process  is  killed  by  a  signal. 

136  */ 
137 

138  mm_e:ul(mp,  status); 

139  dont_reply  =  TRUE;  I*  don't  reply  lo  newly  tetminated  process  */ 

140  rcuimfOK);  /•  pro  forma  return  code  */ 

141  } 
142 

143 

144        r 


145  *  mm_exil 

146  • 


147  PUBLIC  mm_exit(rmp,  exit_sutus) 

148  register  struct  mproc  *rrnp;  /*  pointer  to  the  process  to  be  terminated  • 

149  int  exit_status;  /*  the  process'  exit  status  (for  parent)  */ 

150  { 

151  /*  A  process  is  done  If  parent  is  waiting  for  it,  clean  it  up.  else  hang.  */ 

152  #ifdef  ATARIST 

153  phys  clicks  base,  size; 

154  #endif 

155  phys_clicks  s; 

156  register  int  proc_nr  =  (intXrmp  -  mproc); 
157 

158  /*  How  to  termixtate  a  process  is  determined  by  whether  or  not  the 

159  *  parent  process  has  already  done  a  WATT.  Test  to  see  if  it  has. 
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160  •/ 

161  imp->mp_eMlstatus  =  (char)  exit_iuuus;    /*  store  status  in  'mptoc'  */ 
162 

163  if  (mprocfrmp  >mp  parent]  mp  Hags  A  WAITING)  ( 

>  164  if(mproc[nnp->mp_parenil  mpjags  A  (SWAPPED  |  FKSWAPPED))  { 

>  16S  /*  parent  ii  swapped  A  wailing,  delay  cleanup  by  falsely  */ 

>  166  r  marking  child  HANGING,  telling  deadchild,  and  telling  */ 

>  167  r  SWAP_TASK  •/ 

>  168  rmp->mp  flags  |=  HANGING; 

>  169  mproc[imp->tnp_parent]jnp_dea<lchild  at  proc_nr, 

>  170  mproc[rmp->mp_parent].mp  flags  A=   WATTING; 

>  171  ieply(SWAP_TASK.  CORE_IS_NEEDED. 

>  172  (iru)(Amprc«|rrnp->rnp_parenlj  -  mprcc),  NIL_PTR); 

>  173  }  else 

174  cleanup(rmp);  /•  release  parent  and  tell  everybody  */ 

175  }elie 

176  rmp->mp  flags  |  =  HANGING;   /•  Parent  not  waiting.   Suspend  proc  •/ 
177 

178  /"*  It  the  exited  process  has  a  tinier  pending,  kill  it   */ 

179  if  (rmposnp  flags  A  ALARM_ON)  set_alamt(proc_nr,  (unsigned)  0); 
180 

181  #ifdef  AMKERNEL 

182  /*  see  if  an  amoeba  transaction  was  pending  or  a  putrep  needed  to  be  done  •/ 

183  am_cbeck_sig(proc_nr,  1); 

184  «endif 
185 

186  r  Tell  the  kernel  and  FS  that  the  process  is  no  longer  runnable.  */ 

187  Kifdef  ATARIST 

188  sysjrit(mip->mp_jjarent,  proc_nr.  Abase,  Asize); 

189  free  memfbase.  size); 

190  #else 

191  sy«jrit(rmp->mp_parent,  proc_nr); 

192  #endif 

193  tell_fs(EXiT,  proc_nr,  0,  0);  /*  file  system  can  free  the  proc  slot  */ 
194 

195  #ifndcf  ATARIST 

196  t*  Release  the  memory  occupied  by  the  child.  •/ 

197  s  =  (phys_clicks)  rmp->mp_scg[S].mcm_len; 

198  s  +=  (rmp->mp_seg[S].mem_vir  •  rmp->mp_seg[D].mem_vir); 

199  if  (rmp->rnp_flags  A  SEPARATE)  s  +=  nnp->mp_segrT).mem  len; 

200  free_mem(rmp->mp_seg[T].man_phys,  s);  /*  free  the  memory  •/ 

201  Hendif 
202 

203  ) 

204 

205 

206        r 


207  •  do  wan 

208  » 


209  PUBUC  int  dowaitO 

210  { 

21 1  I*  A  process  wanu  to  wail  for  a  child  to  terminate.  If  one  is  already  waiting, 

212  *  go  dean  it  up  and  let  this  WATT  call  terminate    Otherwise,  really  wait. 
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213  •/ 

214 

215  register  struct  mproc  *rp; 

216  register  int  children; 
217 

218  /*  A  process  calling  WAIT  never  gels  a  reply  in  the  usual  way  via  the 

219  *  replyO  in  the  main  loop.    If  a  child  has  already  exiled,  the  routine 

220  *  cleanupO  sends  the  reply  to  awaken  the  caller. 

221  •/ 
222 

223  f*  Is  there  a  child  waiting  to  be  collected?  •/ 

224  children  •  0; 

>  225  for  (rp  =  4mproc[0];  rp  <  &mproc(NJt_PROCSl;  rp++)  { 

>  226  if  (  (rp->mp_nags  4  INJJSE)  44  rp->mp_parenl  =  who)  { 

>  227  children-* 

>  228  if  (rp->mp_nags  4  HANGING)  ( 

>  229  deanup(rp);  t*  a  child  has  already  exited  */ 

>  230  dontreply  -  TRUE; 

>  231  retum(OK); 
>232                                        ) 

>233  ) 

>234  } 

235 

236  I*  No  child  has  exiled.  Wait  for  one,  unless  none  exists.  */ 

237  if  (children  >  0)  {  /*  does  this  process  have  any  children?  •/ 

238  mp->mp_flags  |  =  WATTING; 

239  don!  reply  ■  TRUE; 
240 

241  for  (rp  =  AmprocHNITPROCNR  +  1);  rp  <  4mprocrNR  PROCS];  rp++  )  ( 

242  if  (  (rp->mp  Hags  4  INJJSE)  =  0)  continue; 

243  if(  (rp-iropjUgs  4  (SWAPPED  |  FKSWAPPED))  44 

244  (rp->mp_nags  4  (PAUSED  |  WAITING)  =  0)  )  ( 

245  ms.m_source  =  MM  PROC  NR; 

246  msjn  type  =  CORE_IS_FREE; 

247  send(SWAP  TASK.  4ms); 

248  break; 

249  ) 

250  ) 

251  retum(OK);  /*  yes  -  wait  for  one  to  exit  */ 

252  }else 

253  retumfECHILD);  f  no  -  parent  has  no  chUdren  •/ 

254  ) 
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4  PUBLIC  mainO 

5  ( 

6  I*  Main  routine  of  the  memory  manager.  */ 
7 

8  int  error, 

9 

10  mm_init();  I*  initialize  memory  manager  taNei  •/ 

11 

12  /*  Thil  is  MM'i  main  loop-  gel  work  and  do  il,  forever  and  forever.  V 

13  while  (TRUE)  { 

14  r  Wail  for  message  •/ 

15  get_work0;  /*  wait  for  an  MM  system  call  •/ 

16  mp  =  &mproc[who]; 
17 

18  r*  Set  some  flags.  */ 

19  error  =  OK; 

20  dontreply  •  FALSE; 

21  err  code  =  -999; 
22 

23  /•  If  the  call  number  is  valid,  perform  the  call.  •/ 

24  if  (mm_call  <  0  1 1  mm_call  >=  NCALLS) 

25  error  =  E_BAD_CALL; 

26  else 

>  27  error  =  ('call  veclmm_cal]]X0); 

28 

29  /•  Send  the  results  back  to  the  user  to  indicate  completion.  •/ 

30  if  (dontreply)  continue;  T  no  reply  for  EXIT  and  WAIT  •/ 

3 1  if  (mmcall  =  EXEC  &  A  error  =  OK)  continue; 

32  reply(who,  error,  result2,  res  jtr); 

33  } 

34  ) 
35 

3o 
37 
38 


39  *  reply 

40 


41  PUBLIC  rcplytprocjiT,  result,  res2,  respt) 

42  int  procjtr,  /*  process  to  reply  to  •/ 

43  int  result;  /*  result  of  the  call  (usually  OK  or  error  #)*/ 

44  int  res2;  /"  secondary  result  */ 

45  char  *respl;  I*  result  if  pointer  •/ 

«  1 

47  /*  Send  ■  reply  to  t  user  process.  */ 
48 

49  register  struct  mproc  *proc_ptr, 
50 

51  t*  To  mike  MM  robust,  check  to  see  if  desdnation  ii  still  ilive.  */ 

>  52  if(proc_nr  1=  SWAP_TASK)  { 

53  proc_ptr  =  &mproc[proc_iir]; 
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54  if  ( (proejxrsmpJIigs&rNJJSE)  =  0  1 1  (r*oc_rxr.>mpJUg>&HANGING)) 

55  return; 

56  } 

57  reply_type  =  result; 

58  rcplyjl  =  n»2; 

59  reply  _pl  =  respi; 

60  if  (Mjid(proc_nr.  &mm_out)  !=  OK)  puiicfMM  can't  reply",  NO_NUM); 

61  } 
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>  9 

>  10 

>  11 

>  12 

>  13 

>  14 

>  15 

>  16 

>  17 

>  18 

>  19 

>  20 

>  21 

>  22 

>  23 

>  24 

>  25 

>  26 

>  27 

>  28 

>  29 

>  30 

>  31 

>  32 

>  33 

>  34 

>  35 

>  36 

>  37 

>  38 

>  39 

>  40 

>  41 

>  42 

>  43 

>  44 

>  45 

>  46 

>  47 

>  48 

>  49 

>  50 

>  51 

>  52 

>  53 


^include 

♦include 
♦include 
♦include 
♦include 
♦include 
#indude 
♦include 
♦  include 
♦indude 
#indude 


"../h/const-h" 

"../Vtypejr 

".-/h/caHnr-h" 

".,/h/com.h" 

".-/h/error.h" 

".Vh/staLh" 

".Vh/signal.h" 

"ccnsLh" 

"glo.h" 

"mproch" 

"pa ram  h" 


♦define  LONG1 


PRIVATE  siruci  mproc  *rmp; 


/*  message  slot  to  cany  long  bitmap  of  */ 
/*  pause/wail  procs,  if  proc  table  has   •/ 
F  more  than  32  slots,  another  method  is*/ 
I*  needed  */ 


do  swout 


do_swout(num) 
int  num. 

{ 

/*  peifoim  request  from  SWAPTASK  to  do  swapout  of  i  particular  process 


mi  pnum; 


mp  =  &mproc[MM_PROC_NR);  f  mp  points  to  MM  •/ 

if(num) 

pnum  =  num. 
else 

pnum  =  mm_in.PROCl; 
if(  (pnum  <  0)  |  |  (pnum  >NR_PROCS)  )  { 

printfC'DSOl:  swapout  proc  out  of  range:  %d0,pnum); 

rttumfERROR); 
} 


rmp  =  &mproc[pnum]; 


/•  rmp  points  to  swapout  proc  */ 


tf(swap_out(pnum,  imp.  rmp.  TRUE)  =  OK)  { 
rmp->mp_fiags  INSWAPPED; 
dont_reply=  TRUE; 

mmouLmlype  =  SWAPOUT COMPL; 
mm_ouLPROCl  =  pnum; 
if(rmp->mp_flags  4  (PAUSED  |  WATTING)) 

mm_out.PROC2  =  TRUE; 
else 

mm__ouLPROC2  >  FALSE; 
if  (send(SWAP_TASK,&mm_out)  !=  OK) 

panicCmswap  can't  lend  mes",  NO_NUM); 
retum(SWAP_OUT_COMPL); 
}else{ 
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> 

54 

> 

55 

> 

56 

> 

57 

> 

58 

> 

59 

> 

60 

> 

61 

> 

62 

> 

63 

> 

64 

> 

65 

> 

66 

> 

67 

> 

68 

> 

69 

> 

70 

> 

71 

> 

72 

> 

73 

> 

74 

> 

75 

> 

76 

> 

77 

> 

78 

> 

79 

> 

80 

> 

81 

> 

82 

> 

83 

> 

84 

> 

85 

> 

86 

> 

87 

> 

88 

> 

89 

> 

90 

> 

91 

> 

92 

> 

93 

> 

94 

> 

95 

> 

96 

> 

97 

> 

98 

> 

99 

> 

100 

> 

101 

> 

102 

> 

103 

> 

104 

> 

105 

> 

106 

ICSUlt2  =  pnum; 
rctum(SWAP_OUT_FAILED); 


■wap_out 


■wap_out(iwap_proc,  nnc,  imp,  clear_mem) 

int  fwap_proc; 

■tract  mproc  *rmc;      f*  child  process  */ 

■tract  mproc  "imp;     /*  parent  proceis  */ 

int  dew  nan; 

{ 

I*  do  actual  work  of  swapping  out  the  process  image  to  swap  device  */ 

phys_clicks  s; 

int  type; 

■tract  mproc  *lmp; 

tmp  =  mp; 

mp  =  &mproc[MM_PROC_NR); 

if(clear_mem)  type  =  1; 

else  type  =  2; 

if(dump_core(rmc,  imp,  type)  !=  OK)  { 

printffSWAPOUT  ERROR0); 

mp  =  tmp; 

return  (ERROR); 
> 

mp  =  tmp; 
if(clear_mem)  { 

/*  Release  the  memory  occupied  by  the  process.  */ 

s  =  (phys_clicks)  rmp->mp  ieg[S].mem_len; 

s  +=  (rmp->mp_seg[S|.mem_vir  -  fmp->mp_seg[D]jnem_vir); 

if  (rmp->mp_flags  &.  SEPARATE)  s  +=  imp->mp  seg[T].mem_len; 

free_mem(nnp->mp_seg['n.mem_phys,  s);  f*  free  the  memory  */ 

} 
retum(OK); 


*  do_iwin 

do_iwin0 

< 

f*  perfoim  request  from  SWAP_TASK  to  do  twapin  of  a  particular  process 
*/ 

int  i,  num. 

phy>_dicks  new_base,  tot_clicks; 
extern  phy»_clicki  alloc_mem0,  tot_hole(); 
extern  cleanupO; 
■tract  mproc  *ipp; 
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>  107         long  bitmap; 

>  108 

>  109  num  =  mmjn.PROCl; 

>  1 10  mp  =  &mproc[MM_PROC_NR];                 /•  mp  poinu  to  MM  •/ 

>  111  imp  =  &mproc|num]./"  imp  points  to  iwipin  process  */ 

>  112  tot_clicki  =  rmp->mp_seg[Sl.mem_phys  • 

>  113  imp->mp_ieg[Tljnem_phyi  + 

>  114  rmp->mp_seg[S).mem_len; 

>  1 15  if(  (uxhoJeO  >=  tot_dicki)  44 

>  116  ((newbase  ■  allc*jnem(toi_click<))  1=  NO_MEM) )  { 

>  117  imp->mpjf»gj  4=  SWAPPED; 

>  118 

>  119  if(swap_in(nurn,  imp,  new_base)  —  OK)  { 

>  120  if(nnp->mp_nigi  4  FKSWAPPED)  ( 

>  121  mip->mp_flags  4=  FKSWAPPED; 

>  122  /»  wake  up  forked  child  'I 

>  123  reply(num,  0.  0.  NTLPTR); 

>  124  ) 

>  125  /*  do  all  processing  required  due  to     •/ 

>  126  /*  proc  being  swapped  out,  messy  stuff  •/ 

>  127  I*  process  signals  recvd  during  swapout  */ 

>  128  if(rmp->mp  ssw_map)  { 

>  129  MM;  i<=NR_SIGS;  i-H-)  { 

>  130  if(rmp->mp_ssw_map  4  (1  «  i  -1)) 

>  131  sig_proc(rmp,  i); 

>  132  } 

>  133  imp->mp_ssw_map  =  0; 

>  134  ) 

>  135  /"  if  proc  was  WATT  4  SWAPPED  4  a  child  died  */ 

>  136  if(rmp->mp_deadchUd)  { 

>  137  cleanup(4mproc(nnp->mp  deadchild]); 

>  138  !mp->mp_deadchi]d  =  0; 

>  139  } 

>  140  /"  wake  up  P/W  proc  awakened  by  signal  while  SWAPPED  •/ 

>  141  if(rmp->mp_ftags  4  WASPWS)  { 

>  142  replyfnum,  HNTR,  0,  NIL_PTR); 

>  143  tmp->mp  flags  4=  WASPWS; 

>  144  } 

>  145  iesult2  =  num; 

>  146  retnmlSWAP  IN  COMPL); 

>  147  )  else  { 

>  148  panicrSWAPDV  ERROR"  ,NO_NUM); 

>  149  } 

>  150  }  else  { 

>  151  /•  no  core  available,  lend  kernel  list  of  PAUSED  4  •/ 

>  152  /•  WATTING  proa  and  amount  of  core  needed  •/ 

>  153  imp->mp_nags  |  =  SWAPPED; 

>  154  bitmap  =  0; 

>  155  fonjpp  •  4mprociINrT_PR0C_NR  +11; 

>  156  rpp  <  4mprocrNR_PROCS];  ipp++)  { 

>  157  if(  ((rpp->rap_nagi  4  TN_USE)  =  0)  1 1 

>  158  (rpp->mp_rlags  4  (HANGING  |  SWAPPED  I  FKSWAPPED)) ) 

>  159  continue; 
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>  160  n*(ipp->mp_flig!  4  (PAUSED  |  WAITING)) 

>  161  bitmap  |  =  1«  (inl)(ipp  -mproc); 

>  162  } 

>  163  mmoounjype  •  SWAPJN_F  AILED; 

>  164  mm_ouLPROCl  =  (uniignedXtot_clicks  •  tol_noleO); 

>  165  mm_ouLLONGl  =  bitmap; 

>  166  if  (sendrec(SWAPjrASKArnm_out)  la  OK) 

>  167  panicfmswap  can't  lend  mes",  NO_NUM); 

>  168  if(mm_out.rn_type  1=  SWAP_OUT_REQ)  { 

>  169  "      donl  reply  =  TRUE; 

>  170  ret»m(SWAPJN_FAILED); 

>  171  )  else  { 

>  172  rentm(do_swaui(mm  ooLPROCl)); 

>  173  ) 
>174                          ) 

>  175         ) 

>  176 

>  177 

>  178        /•  = 

>  179         * 

>  180         » 


swapin 


>  181  PUBLIC  int  swap_in(swap_proc,  rmp,  new_base) 

>  182  int  swap_proc; 

>  183  struct  mproc  'rmp; 

>  184  phys_clicks  new_base; 

>  185  1 

>  186  I*  do  actual  work  of  swapping  in  the  process  image  from  swap  device  "/ 
>187 

>  188  int  fd,  tmp; 

>  189  char  name_bufl5];    /*  the  name  of  the  file  to  swapin  •/ 

>  190  char  zb[ZEROBUF_SIZE];       /•  used  to  zero  bss  •/ 

>  191  struct  sut  s_buf; 

>  192  phys_dicks  gap_cliclcs; 

>  193  extern  int  load_seg0; 

>  194  char  *rzp; 

>  195  vir_bytes  vzb; 

>  196  phys_bytcs  bytes,  base,  count; 

>  197 

>  198  /•  Do  some  validity  checks.  */ 
>199 

>  200  r  Get  the  swap  file  name  •/ 

>  201  rzp  =  namebuf; 

>  202  tmp  =  swap_proc; 

>  203  while(onp)  {  /•  same  algorithm  used  in  dump_core  to  determine  swapname  •/ 

>  204  •rzp++  =  (tmp  «  10)  +  060; 

>  205  Tmp  f=  10; 
>206          } 

>  207  *rzp  =  0; 
>208 

>  209  tell_fs(CHDIR,  0.  2,  0);  ■  temporarily  switch  to  swap  dir  •/ 

>  210  fd  =  aUowed(name_buf,  4s_buf,  R_BIT);  «  is  file  readable?  •/ 

>  21 1  lellfsfCHDIR,  0,  F,  0);  /•  switch  back  to  MM's  own  directory  */ 

>  212  if  (fd  <  0)  { 
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>  213  princfCSWAP  FILE  not  rcadableO); 

>  214  return  (ERROR);          /•  file  wis  noi  readable  •/ 
>215  } 

>216 

>  217  !*  Fix  map  with  new  memory  and  tell  kernel.  */ 

>  218  gap_clicks  =  cmp->mp_»eg[Sljnem_phys  - 

>  219  (nnp->mp_seg[D].mem_phys  + 

>  220  mip->mp_tegrD]-mem_len); 

>  221  nnp->mp_»eg[T]jnem_phyi  =  new_hase; 

>  222  imp->mp_seg[D].mem_phyi  =  new  base  +  rmp->mp_»egrr|-niem_len; 

>  223  imp->mp_feg[S]jDcm_phys  =  rmp->mp_ieg[Dl.meni_phys  + 

>  224  nnp->mp_scg[D]jnem_kn  + 

>  225  gap_clicks; 

>  226  mip->mp  segr/T].mem_vir  =  0; 

>  227  rmp->mp_seg[D].mem_vir  =  0; 

>  228  rmp->mp_seg[S].mem_vir  =  rmp->mp_seg[D].mem_vir  + 

>  229  imp->mp_seg[D].mem_len  + 

>  230  gapclicks; 

>  231  sys_newmap(swap_proc,  rmp->mp_seg,  FALSE);       /*  report  new  map  lo  the  kernel  */ 
>232 

>  233  f*  Zero  the  gap  */ 

>  234  for  (rap  =  zb;  rap  <  &zb[ZEROBUF_SIZE];  rzp++)  *rzp  =  0;     /*  clear  buffer  •/ 

>  235  bytes  =  (phys_bytes)  gapclicks  «  CLJCK_SHIFT; 

>  236  vzb  =  (virbytes)  zb; 

>  237  base  =  flmg)  rmp->mp_seg[D].mem_phys  +  rmp->mp  seg[D].mem_len; 

>  238  base  =  base  «  CUCK~SfflFT; 
>239 

>  240  while  (bytes  >  0)  { 

>  241  count  =  (long)  MIN(bytes,  (phys_bytes)  ZEROBUFSIZE); 

>  242  if  (mem_copy(MM_PROC_NR,  D,  flong)  vzb,  ABS,  0,  base,  count)  1=  OK) 

>  243  panic("new_mem  can't  zero",  NO_NUM); 

>  244  base  +=  count; 

>  245  bytes  -=  count; 
>246  } 

>247 

>  248  I*  Read  in  text,  data  and  suck  segments.  */ 

>  249  load_seg(swap_proc,  fd,  T,  (physbytes)  rmp->mp_seg[T].mem_len  «  CUCK_SHIFT); 

>  250  toad_seg(swap_proc,  fd,  D,  (physbytes)  rmp->mp_seg[D].mem_len  «  CLICK_SHIFT); 

>  251  Ioad_seg(swap_proc,  fd,  S,  (physjsytes)  rmp->mp_seg[S].mem_len  «  CLICKSFUFT); 

>  252  dose(fd);                                  /*  don't  need  swap  file  any  more  */ 
>253 

>  254  tell_fs(CHDIR,  0,  2,  0);  /*  temporarily  switch  to  swap  dir  */ 

>  255  if(unlink(name_buf)  !=  0) 

>  256  printff  unlink  error:  %s0jiame_buf); 

>  257  tell_fs(CHDIR,  0,  1,0);            f*  switch  back  to  MM's  own  directory  •/ 
>258 

>  259  rctum(OK); 
>260  } 
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check_sig 


4  PRIVATE  inl  check_iig(proc_id,  ugjtr,  sendjiid) 

5  im  proc_id;  /*  pid  of  process  to  signal,  or  0  or  -1  */ 

6  inl  signr,  /*  which  signal  to  send  (1-16)  */ 

7  uid  lendjrid;  /•  identity  of  process  sending  the  signal  */ 

8  { 

9  /*  Check  to  see  if  it  is  possible  to  send  a  signal.  The  signal  may  have  lo  be 

10  *  sent  to  a  group  of  processes.   This  routine  is  invoked  by  the  KILL  system 

1 1  *  call,  and  also  when  the  kernel  catches  a  DEL  or  other  signal.  SIGALRM  too. 

12  •/ 
13 

14  register  struct  mproc  *rmp; 

15  tht  count,  send  tig; 

16  unshort  mask; 

17  extern  unshon  core_bits; 
18 

19  if  (signr  <  1  1 1  sig_nr  >  NRSIGS)  retumfETNVAL); 

20  count  =  0,  /*  count  #  of  signals  sent  •/ 

21  mask  =  1  «  (sig_nr  -  1); 
22 

23  I*  Search  the  proc  table  for  processes  to  signal.   Several  tests  are  made: 

24  *  -  if  proc's  uid  1=  sender's,  and  sender  is  not  superoser,  don't  signal 

25  *  -  if  specific  process  requested  (i.e.,  'procpid'  >  0),  check  for  match 

26  *  -  if  a  process  has  already  exited,  it  can't  receive  signals 

27  *  -  if  'proc_id'  is  0  signal  everyone  in  same  process  group  except  caller 

28  */ 

29  for  (rmp  =  AmpiocrrNTTraOCNR  +  1];  imp  <  &mproc[NR_PROCS];  imp++  )  { 

30  if  (  (imp->mp_flags  &  INFUSE)  =  0)  continue; 

31  #cnd_sig  =  TRUE;      /•  if  it's  FALSE  at  end  of  loop,  don't  signal  */ 

32  if  (sendjuid  1=  imp->mp_effuid  &&  sendjiid  !=  SUPER_USER)send_sig=FALSE; 

33  if  (proc_id  >  0  &&  procjd  1=  tmp->mp_pid)  send_sig  =  FALSE; 

34  if  (imp->mp_flags  &  HANGING)  sendsig  =  FALSE;     /*don't  wake  the  dead*/ 

35  if  (proc_id  =  0  &&  mp->mp_procgrp  1=  rmp->mp_procgrp)  send_sig  =  FALSE; 

36  if  (sendjiid  =  SUPER  USER  &.&.  proc_id  =  -1)  send  sig  =  TRUE; 
37 

38  /*  SIGALARM  is  a  little  special.   When  a  process  exits,  a  clock  signal 

39  *  can  arrive  just  as  the  timer  is  being  turned  off.   Also,  mm  off 

40  *  ALARM_ON  bit  when  timer  goes  off  to  keep  it  accurate. 

41  V 

42  if  (rigju  =  SIGALRM)  { 

43  if  (  (nnp->mp_flags  &.  ALARM_ON)  =  0)  continue; 

44  if  (sendsig)  imp->mp  flags  &=  ALARMON; 

45  } 
46 

47  if  (send_sig  =  FALSE)  continue; 

48  count++; 

49  if  (imp->mp  ignore  &.  mask)  continue; 
50 

51  #ifdef  AMKERNEL 

52  /*  see  if  an  amoeba  transaction  should  be  signalled  *./ 

53  Tf s  =  am_check_sig(nnp  -  mproc,  0); 
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54  #endif 

55 

>  56  /•  Send  the  signal  or  kill  the  process,  possibly  with  core  dump.  */ 

>  57  if(nnp->mp_flags  &  (SWAPPED  |  FKSWAPPED)) 

>  58  /»  ptoc  is  SWAPPED,  delay  processing  until  swapped  in  •/ 

>  59  nnp->mp_ssw_map  |=  1  «  (sig_nr  -1); 

>  60  else 

>  61  sig_proc(rmp,  sig_nr); 
62 

63  **  If  process  is  hanging  on  PAUSE,  WATT,  try,  pipe,  etc.  release  it.  V 

64  unpause{(intXrmp  -  mproc));       /*  check  to  see  if  process  is  paused  */ 

65  if  (proc_id  >  0)  break;  /•  only  one  process  being  signaled  */ 

66  } 
67 

68  /•  If  the  calling  process  has  killed  itself,  don't  reply.  */ 

69  if  ((mpomprlags  &  INJJSE)  =  0  |  |  (mp->mp_nags  &  HANGING))dont_reply  =TRUE; 

70  retum(count  >  0  7  OK  :  ESRCH); 

71  } 
72 

73 

74  i- 


75  *  do_pause 

76  ■ 


77  PUBLIC  inl  do_pause() 

78  { 

79  /*  Perform  the  pauseO  system  call.  */ 

80  struct  mproc  armp; 
81 

82  mp->mp  flags  |=  PAUSED;     f  rum  on  PAUSE  bit  •/ 

83  dontrepTy  =  TRUE; 
84 

>  85  for  (imp  -  *mprocriNrr_PROC_NR  +  1];  rmp  <  &mproc[NR_PROCS];  rmptt  )  ( 

>  86  if  (  (rmp->mp_flags  &  IN_USE)  =  0)  continue; 

>  87  if(  (rmp->mp_flags  4  (SWAPPED  |  FKSWAPPED))  && 

>  88  (rmp->mpHags  &  (PAUSED  |  WATTING)  =  0)  )  { 

>  89  reply(SWAP  TASK,  CORE  IS_EREE,  0,  NILPTR); 

>  90  break; 

>  91  } 

>  92  ) 

93  retum(OK); 

94  } 
95 

96 

97  r 

98  • 

99  * 


unpause 


100  PRIVATE  unpauselpro) 

101  in!  pro;  /•  which  process  number  */ 

102  { 

103  /*  A  signal  is  to  be  sent  to  a  process.  If  that  process  is  hanging  on  a 

104  *  system  call,  the  system  call  must  be  terminated  with  ETNTR.   Possible 

105  •  calls  are  PAUSE,  WAIT.  READ  and  WRITE,  the  latter  two  for  pipes  and  trys. 

106  •  First  check  if  the  process  is  hanging  on  PAUSE  or  WATT.  If  not,  tell  FS. 
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107 

108  •/ 

109 


•  io  it  can  check  for  READi  aid  WRTTEi  from  pipes,  ttys  and  the  like. 


110  register  struct  mproc  •rmp; 

HI 

112  rmp  =  &mproc[pro]; 

113 

114  /*  Check  to  tee  if  process  is  hanging  on  a  PAUSE  call.  •/ 

115  if  (  (rmp.>mp_f]ags  *  PAUSED)  Sl&  (imp->mp_flags  &  HANGING)  =  0)  ( 
H6  nnp->mp  flags  4=  PAUSED;    /•  mm  off  PAUSED  bit  •/ 

>  H'  if(rmp->mp_flags  &  (SWAPPED  |  FKSWAPPED))  { 

>  1 18  f*  PAUSED  swapped  proc  awakened,  delay  notice  &  tell  kernel  */ 

>  H9  tmp-Mnp  flags  |=WASPWS; 

>  12"  reply(SWAP  TASK.  CORE  IS  NEEDED,  pro.  NIL  PTR); 

>  121  }  else 

122  reply(pro,  EINTR,  0,  NILPTR); 

123  return; 

124  } 
125 

126  /*  Check  to  see  if  process  is  hanging  on  a  WAIT  call.  •/ 

127  if  (  (rmp->mp  flags  &  WATTING)  &&  (rmp-Jinp  flags  &  HANGING)  =  0)  { 

128  rmp->mp_flags  &=  WATTrNG;  /*  rum  off  WATTING  bit  */ 

>  129  if(rmp->mp  flags  &  (SWAPPED  |  FKSWAPPED))  { 

>  I30  /*  PAUSED  swapped  proc  awakened,  delay  notice  &  tell  kernel  */ 

>  131  rmp.>mp_flags  |=WASPWS; 

>  132  replyfSWAPTASK,  COREJSNEEDED,  pro,  NIL  PTR); 

>  133  )  else 

134  reply(pro,  ETNTR,  0,  NILPTR); 

135  return; 

136  } 
137 

138  #ifdef  AMKERNEL 

139  /*  if  it  was  an  amoeba  transaction,  it  is  already  tidied  up  by  now  •/ 

140  if  (Tfs) 

141  #endif 

142  /•  Process  is  not  hanging  on  an  MM  call.  Ask  FS  to  take  a  look.  •/ 

143  leIl_fs(UNPAUSE,  pro,  0,  0); 
144 

145  return; 

146  } 
147 

148 

149  r 

150  • 

151  - 

152  PUBLIC  dump_core(rmc,  rmp,  type) 

153  struct  mproc  *rmc;      /*  child  proc  for  swapout  */ 

154  struct  mproc  *imp;     I*  whose  core  is  to  be  dumped  */ 

155  inl  type;  /*  0-dump  core;  1-swapout;  2-sp  out  wmoadiusl  */ 

156  {  " 

157  I*  Make  a  core  dump  on  the  file  "core",  if  possible.  V 
158 

>  159  struct  slat  s_buf,  d_buf; 


dump_core 


Appendix  B-29  -  MODIFICATIONS  TO  MEMORY  MANAGER  CODE 


Jul  6  14:41  1989  SIGNALC  Page  4 


>  160  int  i,  r,  I,  new_fd,  slot,  dir,  flig,  byles; 

>  161  vir_byles  v_b«f,  c,  new_sp; 

>  162  ditr  'a; 

>  163  SUUCt  DiprOC  'imp; 

>  164  extern  char  core  namcl]; 

>  165  exlem  adjustf); 

>  166  char  swap_namel4]; 
>167 

>  168  ilot  =  (uilXtmc  -  mproc); 

>  169  /*  Change  to  wonting  directory  of  dumpee.  •/ 

>  170  iffjype  >  0)  { 

>  171  dir  =  0;/*  iwapout  */ 

>  172  tag  =  2; 

>  173  }  elie  { 

>  174  dir  =  riot;              /*  dump_core  •/ 

>  175  Sag  =  0; 

>  176  } 

>  177  leH_fs(CHDIR,  dir.  flag,  0); 

>  178 

>  179  *>  Can  file  be  written?  */ 

>  180  if(  (type  =  0)  &A  (rrnc->mp_realuid  1=  rmc->mp_effuid)  )  { 

>  181  teU_fs(CHDIR,  0,  1,  0);               /•  go  back  to  MM'i  directory  •/ 

>  182  ietum(ERROR); 

>  183  } 

>  184  if(type  =  0)  { 

>  185  imp  =  mp;                                              /•  allowedO  look!  at  'mp'  */ 

>  186  mp  =  imc; 

>  187  ) 

>  188 

>  189  ifftype  >  0)  { 

>  190  dir  =  riot; 

>  191  a  =  swapjiame; 

>  192  while(dir)  { 

>  193  »a++  =  (dir  *  10)  +  060; 

>  194  dir*=10; 
>195  } 

>  196  «a  =  0; 

>  197  r  =  allowedfswapname.  Asbuf,  W_BTT);              /*  is  iwap  file  writable  •/ 

>  198  }  else 

>  199  r  =  allowed(core_name.  &s_buf,  W_BIT);                /*  is  core  file  writable  •/ 
>200 

>  201  ■  ■  allowedr".  &d_buf,  WBIT);             /*  is  directory  writable?  */ 
>202 

>  203  if  (type  =  0) 

>  204  mp  =  xmp; 

>  205  if  (r  >=  0)  dose(r); 

>  206  if  (s  >=  0)  close(s); 

>  207  if  ((type  >  0)  |  |  (rmc->mp_effuid  =  SUPERUSER)) 

>  208  r  =  0;  /*  n  can  always  dump  core  •/ 
>209 

>  210  if  (s  >-  0  &&  (r  >-  0  |  |  r  =  ENOENT))  { 

>  21 1  f  Either  file  is  writable  or  il  doesn't  exist  A  dir  is  writable  V 

>  212  ifftype  >  0) 
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>  213  t-  creat(swap_name,  SWAP  MODE); 
>214                          else 

>  215  t  »  CTtal  (core  name.  COREMODE); 

>  216  tell  fl(CHDIR.  0,  1.  0);  /•  go  back  to  MM'l  own  dir  •/ 
>217                         if(r<0){ 

>  218  printfC  create  errorO); 

>  219  retum(ERROR); 
>220                          } 

>221 

>  222  if  (type  1=  2)  { 

>  223  /•  adjust  memory  mip,  if  neceiury  (already  done  for  type  2)  */ 

>  224  sys  _getsp(slot,  Anew_sp); 

*  225  if(adjusi(rmc,  (virclicks)  rmc-5snp_seg[DI.mem_len,  new  sp) 

>  226  1=  OK)  ( 

>  227  printfC  ADJUST  ERRORO); 

>  228  close(r); 

>  229  remm(ERROR); 
>230                               ) 

>231  } 

>232 

>  233  if(type  =  0)  { 

>  234  imc->mp  sigstarus  j  =  DUMPED; 
>235 

>  236  /•  First  write  the  memory  map  of  all  segments  on  core  file.  */ 

>  237  if  (writefr,  (char  •)  rmc->mp_seg,  sizeof(rmc->mp_seg))  <  0)  { 

>  238  close(r); 

>  239  printfC  write  memory  map  errorO); 

>  240  retum(ERROR); 
>241                                        } 

>242  ) 

>243 

>  244  ifftype  =  2) 

>  245  slot  =  (inl)  (tmp  -  mproc); 

>  246  t*  Now  loop  through  segments  and  write  the  segments  themselves  out.  */ 

>  247  for  (i  =  0;  i  <  NRSEGS;  i++)  { 

*  248  a  =  (char  *)  (rmc->snp_seg[i].mem_vir  «  CLICKSfflFr); 
»  249  c  =  (int)  (rmc>mp_seg[i].mem_len  «  CLICK  SHUT); 

*  250  newfd  =  ((intXtmp  -  mproc)  <<  8)  I  (i  «  6)  7  r; 
>251 

>  252  I*  Dump  segment.  */ 

>  253  while(c)  ( 

>  254  bytes  =  31  <  1024; 

»  255  if(c  <  (vir _bytes)bytes) 

>  256  byte,  .  („,,)£; 

>  257  if  (write(new_fd,  a,  bytes)  1=  bytes)  { 

>  258  close(r); 

>  2s9  prinrffwrite  segment  errorO); 

*  2*>  retum(ERROR); 
>261                                                            J 

>  262  a  +=  bytes; 

>  263  c  -=  bytes; 
>264                                           ) 

>265  ) 
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>266 

}elSe< 

>267 

printff  iwip  file  or  dir  ii  not  writeibleO); 

>268 

leU_fi(CHDIR,  0.  !,  0);               /»  go  tack  to  MM1! 

I  own  dir  */ 

>269 

do«e(r); 

>270 

i«um(ERROR): 

>271 

) 

>272 

>273 

do«e(r); 

>274 

retum(OK); 

>275 

} 
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1  /*  This  file  contains  the  ubtt  used  to  map  system  call  numbers  onto  the 

2  *  routines  that  perform  them. 

3  */ 
4 

5  #mclude  ".Vh/consLh" 

6  #include  ".7h/typeJi" 

7  ^include  "consUi" 
8 

9  #undef  EXTERN 

10  #defjne  EXTERN 
11 

12  #include  ".7h/callnrJi" 

13  #inciude  "glah" 

14  #indude  "mproch" 

15  #indude  "param.h" 
16 

17  f  Miscellaneous  */ 

18  char  core_namefl  =  {"core"};       /*  file  name  where  core  images  are  produced  */ 
>   19  char  swap_nameQ  =  {"xxxx"};    /*  swap  device  file  names  */ 

20  #ifdef  ATARIST 

21  /• 

22  *  Creating  core  files  is  disabled,  except  for  SIGQUTT  and  SIGIOT. 

23  *  Set  core  bits  to  OxOEFC  if  you  want  compatibility  with  UNIX  V7. 

24  */ 

25  unshort  corebits  ■  OxOEFC;       /*  which  signals  cause  core  images  */ 

26  #else 

27  unshort  core_bits  =  OxOEFC;       /*  which  signals  cause  core  images  */ 

28  #endif 
29 

30  extern  char  mm_sUck[]; 

31  char  •sUckpt  =  Aram  sUck[MM  STACK_BYTES];    /*  initial  stack  pointer  */ 
32 

33  extern  do_mm_exit0,  doforkO.  do_w«it(),  dobrkO.  do_getset0.  doexecO; 

34  extern  do_signal(),  dokillO,  do_pause0,  do_alarm(); 

»  35  extern  no  sysO.  do_ksig0,  do  brk20,  do_swin0.  do  swoutO; 
36 

37  #ifdef  AMJCERNEL 

38  extern  do_amoeba(); 

39  #endif 
40 

41  int  (*caJl_vec[NCALLS]X)  =  { 

42  no_sys,  /*  0  =  unused  */ 

43  do_mm_exii,  /*    1  =  exit  */ 

44  dofork,     I*   2  =  fork  */ 

45  no_sys,  /*  3  =  read  •/ 

46  no_sys,  /*  4  =  write  */ 

47  no_sys,  /*   5  =  open  */ 

48  no_sys,  /*  6  =  close  »/ 

49  dowait,     /*   7  =  wait  */ 

50  nosys.  I*    8  =  creal  */ 

51  no_sys.  /•   9  =  link  */ 

52  no_iys,  /*  10  =  unlink  •/ 

53  no_iys,  f*  11  =  exec  */ 
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54 

no_iyi. 

f  12  «  chair 

•/ 

55 

no_sys. 

/•  13  =  time 

•/ 

56 

no_sys. 

r  14  •  mknod 

•/ 

57 

no_iys. 

r  15  =  chmod 

'/ 

58 

no_iys. 

/*  16  =  chown 

•/ 

59 

do'brk. 

/*  17*  break 

'/ 

60 

no  sys. 

r  18  =  stal 

•/ 

61 

no_sys. 

r  19  =  Iseek 

*/ 

62 

do_getset 

,/*20 

=  gelpid              */ 

63 

no  syi. 

/*  21  =  mount 

V 

64 

no_iyi. 

I*  21  =  umount 

•/ 

65 

do_gelset 

./•13 

=  setuid             •/ 

66 

do_getset 

,/*24 

=  gcluui             */ 

67 

no_sys. 

/*  25  =  slime 

•/ 

68 

no_sys. 

r  26  •  (ptracc)'/ 

69 

do  alarm. 

/•27, 

=  alarm              •/ 

70 

no_sys. 

/*  28  =  fstal 

*/ 

71 

do_pause, 

rw 

=  pause             */ 

72 

no^sys. 

I*  30  =  utime 

V 

73 

no_sys. 

l"  31  =  (stty) 

*/ 

74 

no_sys. 

/•  32  =  (guy) 

*/ 

75 

no_iys. 

/*  33  =  access 

*/ 

76 

no  sys. 

/*  34  =  (nice) 

•/ 

77 

no  sys. 

/•  35  =  (fume) 

•/ 

78 

no  sys, 

/•  36  =  sync 

*/ 

19 

do'kill. 

m-. 

=  kill                 */ 

80 

no  sys. 

/•  38  =  unused 

*/ 

81 

no_sys. 

/*  39  =  unused 

•/ 

82 

no  sys. 

/*  40  =  unused 

•/ 

83 

no  sys. 

/"  41  =  dup 

*/ 

84 

no  sys, 

/•  42  =  pipe 

*/ 

85 

no_sys. 

/*  43  =  limes 

*/ 

86 

no_sys. 

/*  44  «  (proO 

"/ 

87 

no  sys. 

/*  45  =  unused 

•/ 

88 

do_getset. 

/»46: 

=  setgid             */ 

89 

do_getsel. 

/»47. 

=  gelgid             */ 

90 

do_signal,  1*  48  = 

■  sig*/ 

91 

no  sys. 

/•  49  =  unused 

•/ 

92 

no_sys. 

/*  50  =  unused 

•/ 

93 

no_sys. 

/*  51  =  (aca) 

•/ 

94 

no_sys. 

/•  52  =  (phys) 

•/ 

95 

no_sys. 

/•  53  =  (lock) 

•/ 

96 

no_iys. 

/»54=iocu 

*/ 

97 

no  sys. 

r*  55  =  unused 

•/ 

98 

no  sys. 

/*  56  =  (mpx) 

•/ 

99 

no_sys. 

/•  57  =  unused 

•/ 

100 

no  sys, 

/*  58  =  unused 

•/ 

101 

do_«iec. 

/*59  = 

i  exece              •/ 

102 

no_sys. 

/*  60  =  iimask 

*/ 

103 

no_sys. 

/*  61  =  chrool 

*/ 

104 

no_sys. 

1*  62  =  unused 

•/ 

105 

no_sys. 

/*  63  =  unused 

•/ 

106 
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107 
108 
109 
110 
111 
112 
113 
114 
115 
116 
117 
118 
119 
>120 
>  121 
122 
123 
124 
125 
126 


do  ksig,     /*  64  i 

no_tys, 

do_bit2.    /•«. 

no_«y«, 

no  lyi, 
#ifdef  i8088  " 
#ifdef  AMJCERNEL 

do_amoeba, 
#elje 

no_lys, 
#endif 
•aidif  18O88 
ttidef  SWAPER 

do_swin,    I*  70  : 

do_iwoul,  /•  71  : 


KSIG:  signals  originating  in  the  kernel  "/ 

/»65  =  UNPAUSE     •/ 

BRK2  (uied  to  Mil  MM  lize  of  FSJNIT)  •/ 
r  67  =  REVIVE        </ 
/•  68  =  TASK  REPLY  •/ 


/•  69  =  AMOEBA  SYSTEM  CALL  •/ 
/*  69  =  AMOEBA  SYSTEM  CALL  •/ 


#e]se 


#endif 

}; 


no_sys, 
no_sys. 


swap  in  */ 
swap  out  */ 

/*  70  =  swap  in  */ 
/*  71  =  swap  out  */ 
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1  - 

2  •  fsjnit                                                 * 

3  - 

4  PRIVATE  fsinilO 

5  ( 

6  P  Initialize  global  variables,  tables,  etc.  */ 
7 

8  register  struct  mode  *rip; 

9  inti; 

>  10  extern  struct  mode  *get_inode0.  •lwap_nouc; 
11 

12  bufjjoolO;  I*  tftituJize  buffer  pool  •/ 

13  Ioad_ram0;  f  Loss!  RAM  disk  from  root  diskette  */ 

14  k»d_super0;  *»  Load  super  block  for  root  device  */ 
15 

16  /*  Initialize  the  'fproc'  fields  for  process  0  and  process  2.  •/ 

17  for  (i  =  ft  i  <  3;  i+=  2)  ( 

18  fp  =  &fproc[i]; 

19  rip  •  get_inode(ROOT_DEV,  ROOTINODE); 

20  fp->fp_rooidir  =  rip; 

21  dup_inode(rip); 

22  fp->fp_workdir  =  rip; 

23  fp->fp_realuid  =  (uid)  SYSUID; 

24  fp->fp_effuid  =  (uid)  SYSUID; 

25  fp->fp_realgid  •  (gid)  SYSGID; 

26  rp->fp_effgid  •  (gid)  SYSGID; 

27  fp->fp_umask  =  0; 

28  ) 

>  29  swapjwde  =  NILJNODE; 
30 

31  /*  Certain  relations  must  hold  for  the  file  system  to  work  at  all.  */ 

32  if  (ZONE_NUM_SEE  b  2)  panicfZONE  NUMSEE  1=  2",  NONUM); 

33  if  (SUPER_SIZE  >  BLOCKSEE)  panic("SUPER_SEE  >  BLOCK_SEE" ,  NONUM); 

34  if(BLOCK_SEE  %  DJODESIZE  1=  0)panic("BLOCK  SEE  %  INODE  SEE  1=  0",  NO_NUM); 

35  if  (NR_FDS  >  127)  panicC'NR_FDS  >  127",  NONUM]; 

36  if  (NR_BUFS  <  6)  panic("NR_BUFS  K  6-  no_NUM): 

37  if  (iizeof(d  inode)  !=  32)  panieCinode  size  1=  32",  NO  NUM); 

38  ) 
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1  /*  This  file  contains  the  axle  for  performing  four  system  calls  relating  to 

2  *  (talus  and  directories. 

3  * 

4  *  The  entry  points  into  this  file  are 

5  *    dochdir  perform  the  CHDIR  system  call 

6  *    do_chroot:  perform  the  CHROOT  system  call 

7  *    do_stat:  perform  the  STAT  system  call 

8  *    dofstat:  perform  the  FSTAT  system  call 

9  */ 
10 

1 1  #include  ".Vh/consUi" 

12  #include  "../h/type.h" 

13  #include  "../h/error.h" 

14  #include  "../h/stat-h" 

15  #indude  "consUi" 

16  #include  "type.IT 

17  #indude  "filth" 

18  #indude  "fproc.h" 

19  #indude  "glo.h" 

20  #indude  "inode.h" 

21  #indude  "param.rT 

22  extern  struct  mode  *swap_node; 

23  char  iwap^dirH  =  {"Aisr/swap"}; 


24 

25  r 

26  *  do_chdir 

27  * 


28  PUBLIC  int  dochdirQ 

29  { 

30  /*  Change  directory.  This  function  is  also  called  by  MM  to  simulate  a  chdir 

31  "in  order  to  do  EXEC,  etc. 

32  •/ 
33 

34  register  struct  fproc  *rfp; 

35  intr; 
36 

37  if  (who  =  MM_PROC  NR)  { 

>  38  if(cd_flag  =  2)  { 

>  39  if(swap_node  =  NTLJNODE) 

>  40  if(  (r=change(&swap_node,  swapdir,  9))  !=  OK)  { 

>  41  retum(r); 

>  42  } 

>  43  dup_inode(swap_node); 

>  44  put_inode(fp->fp_workdir); 

>  45  fp->fp_workdir  =  swap  node; 

>  46  fp->fp_effuid  =  SUPER_USER; 

>  47  rcwm(OK); 

>  48 

>  49  }dse{ 

50  rfp  =  &i proclslot  I  |; 

51  put_inode(fp->fp_workdir); 

52  fp->fp_workdir  =  (cd  flag  =  1  ?  fp->fp_rootdir  :  rfp->fp_workdir); 

53  dup_inode(fp->fp_workdir); 
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54  fp->fp  effuid  =  (cdjiag  =  1  ?  SUPER  USER  :  rfp->fp  effuid); 

55  leturaCOK); 

56  } 

57  } 
58 

59  I*  Perform  the  chdir(namc)  syilem  call.  •/ 

60  return  change(&fp->fp_workdir,  name,  name  JenguY). 

61  } 
62 

63  r 

64  *  change  • 

65  « 

66  PRIVATE  int  change(iip,  namejxr,  len) 

67  struct  mode  **iip;  /*  pointer  to  the  inode  pointer  for  the  rhr  •/ 

68  char  *name_pir;  /*  pointer  to  the  directory  name  to  change  to  */ 

69  int  len;  I*  length  of  the  directory  name  string  */ 

70  I*  if  =  0,  then  use  name_ptr  directly  */ 

71  { 

72  /*  Do  the  actual  work  for  chdirO  and  chrootO-  */ 
73 

74  struct  inode  "rip; 

75  register  int  r; 

76  extern  struct  inode  "eat_path0; 
77 

78  I*  Try  to  open  the  new  directory.  */ 

79  r=0; 

>  80  if(cd_flag  =  2) 

>  81  <lol 

>  82  user_path[r]  =  namejjtrlr]; 

>  83  }  while  (name_ptr(r++]  1=  0); 
84 

85  else 

86  if  (fetch_name(name_ptr,  len,  M3)  !=  OK)  retum(err_code); 
87 

88  if  ( (rip  =  eat_paul(user_path))  =  NUINODE)  retum(en_code); 

89  I*  It  must  be  a  directory  and  also  be  searchable.  */ 

90  if  ( (rip->i_mode  &  ITYPE)  1=  IDIRECTORY) 

91  r  =  ENOTDIR; 

92  else 

93  r  =  forbidden(rip,  X_BIT,  0);       /•  check  if  dir  is  searchable  •/ 
94 

95  /*  If  error,  return  inode.  */ 

96  if  (r  1=  OK)  { 

97  pul_inode(rip); 

98  retum(r); 

99  } 
100 

101  f*  Everything  is  OK.  Make  the  change.  •/ 

102  puljnode(*iip);  I*  release  the  old  directory  */ 

103  *iip  =  rip;  /*  acquire  the  new  one  */ 

104  rehrm(OK); 

105  ) 
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